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Summary 

"  Homing  missile  guidance  is  formulated  as  an  optimal  stochastic  control 
problem  where  the  special  nonlinear  structure  of  the  missile- target  engage¬ 
ment  is  exploited.  Since  this  stochastic  control  problem  assumes  a  nested 
information  pattern,  the  filter  structure  can  be  developed  independently  of 
the  guidance  scheme.  However,  the  guidance  scheme  is  dependent  on  and 
affects  filter  performance.  Significant  progress  is  being  made  on  both  the 
estimation  problem  and  the  guidance  problem. 

""  Investigation  of  the  nonlinear  estimators  especially  tailored  to  the  hom¬ 
ing  missile  problem  has  produced  not  only  a  good  deal  of  insight  but  re¬ 
sponsive  and  mechanizable  schemes.  Although  these  schemes  are  applicable 
to  active  sensors,  our  emphasis  has  been  on  the  more  difficult  passive  sensor 
case  where  only  angles  are  available.  Recently-developed  schemes  based  on 
coordinate  transformations  and  on  an  assumed  probability  density  function 
perform  well,  but  the  modified-gain  extended  Kalman  filter  seems  to  be  the 
most  promising.  Furthermore,  this  filter  has  been  used  as  the  basis  of  a 
stochastic  adaptive  flight  control  scheme.  In  order  to  improve  this  class 
of  stochastic  control  schemes,  new  results  have  been  obtained  in  control 
synthesis  for  structured  plant  uncertainties. 

~  Two  important  current  efforts  in  missile  guidance  with  bearings-only 
information  are  in  the  development  of  the  guidance  schemes  that  enhance 
an  information  measure  by  trajectory  modulation  and  in  target  accelera¬ 
tion  detection.^  A  mechanizable  guidance  law  based  upon  linear-quadratic- 
Gaussian  theory  which  modulates  the  path  initially  to  enhance  the  infor¬ 
mation  measure  but  which  meets  terminal  miss  constraints  has  been  tested. 
Finally,  based  upon  deterministic  detection  filter  design  by  spectral  meth¬ 
ods,  a  detection  scheme  for  rapidly  detecting  target  motion  has  been  de¬ 
veloped  and  is  being  compared  with  current  designs. 

Research  Objectives  and  Status 

A  special  class  of  stochastic  control  systems  is  being  developed  for  the 
guidance  system  of  a  homing  missile  by  exploiting  the  special  nonlinear 


structure  of  the  missile-target  engagement.  Improvements  are  required 
in  the  current  guidance  law,  proportional  navigation,  because  the  guid¬ 
ance  system  degrades  under  initial  intercept  geometries  that  produce  large 
nonlinearities  about  the  homing  triangle  or  due  to  active  target  motion 
which  also  induces  large  nonlinearities.  Our  guidance  law  investigations 
have  emphasized  measurements  from  passive  sensors  for  which  only  bear¬ 
ing  information  is  available.  This  bearings-only  guidance  problem  is  most 
challenging  because  the  stochastic  controller  has  the  dual  role  of  enhancing 
filtler  performance  and  achieving  target  intercept  with  minimal  expected 
terminal  cost.  However,  this  problem  is  somewhat  simplified  since  the  sep¬ 
aration  theorem  in  the  sense  of  Witsenhausen  is  satisfied.  The  separation 
theorem  states  that  the  filter  structure,  given  the  classical  information  pat¬ 
tern,  is  independent  of  the  controller  structure  although  the  controller  is 
highly  dependent  on  the  predicted  filter  performance. 

Motivated  by  the  separation  theorem,  high-performance  estimators  have 
been  developed  which  are  tailored  to  the  special  nonlinearities  of  the  missile- 
target  engagement.  One  new  estimator,  called  the  modified-gain  extended 
Kalman  filter  (MGEKF),  is  applicable  to  two  important  engineering  prob¬ 
lems:  bearings-only  estimation  [1]  and  state  and  parameter  estimation  [2]. 
Although  we  consider  the  MGEKF  a  breakthrough  in  guidance  filter  devel¬ 
opment,  the  assumed-density  filter  [3]  and  the  coordinate- transformation- 
based  filter  [4]  have  also  shown  considerable  promise.  Since  the  conditional 
mean  estimator  is  infinite  dimensional,  the  finite-dimensional  MGEKF  is 
proposed  as  the  estimation  processor  for  the  homing  guidance  dual  con¬ 
troller.  Furthermore,  the  MGEKF  is  also  proposed  as  the  state  and  pa¬ 
rameter  estimator  for  an  explicit  adaptive  control  law  which  is  applicable 
to  flight  control  and  autopilot  design.  In  particular,  the  MGEKF  has  been 
applied  to  the  problem  of  on-line  state  estimation  and  the  identification 
of  aircraft  stability  derivatives  [5j.  An  adaptive  control  loop  using  this  es¬ 
timator  is  given  in  [6]  where  the  essential  parameter  required  is  moment 
coefficient  due  to  elevator  defection.  The  adaptive  gain  is  inveresely  pro¬ 
portional  to  this  parameter  which  seems  well  estimated  by  the  MGEKF 
even  in  moderately-high  clear  air  turbulence.  However,  more  elaborate 
controllers  will  be  required  for  bank-to-turn  missiles.  A  multivariable  syn¬ 
thesis  scheme  is  suggested  in  [7]  in  which  the  LQG  controller  can  be  made 
insensitive  to  a  class  of  parameter  variations.  It  is  seen  in  [5,6]  that  the 
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moment  coefficients  are  estimated  well  but  the  force  coefficents  are  not.  In 
particular,  their  estimation  response  is  quite  sluggish  due  to  the  effect  of 
high-frequency  noise  associated  with  the  model  of  the  clear  air  turbulence. 
An  adaptive  system  is  being  designed  so  that  the  controller  is  only  sensi¬ 
tive  to  the  moment  coefficients.  This  approach  to  autopilot  design  is  being 
considered  for  application  to  a  bank-to-turn  missile. 

Both  homing  missile  guidance  and  adaptive  control  schemes  are  cur¬ 
rently  designed  based  upon  the  certainty  equivalence  principle.  That  is, 
a  controller  and  estimator  are  placed  in  cascade  where  both  are  designed 
independently  of  one  another.  These  ad  hoc  controller  structures  are  not 
adequate  in  general,  and  improvements  are  sought  through  the  dual  con¬ 
trol  concept.  The  dual  controller  structure  which  has  never  been  realized 
by  even  the  simplest  stochastic  control  example  needs  much  study.  We 
began  our  efforts  by  noting  that  the  essence  of  the  dual  control  problem 
is  captured  in  deterministic  setting  where  the  nonlinear  observer  perfor¬ 
mance  is  enhanced  by  trajectory  modulation.  In  particular,  a  measure 
associated  with  the  Fisher  information  matrix  is  maximized  in  order  to 
obtain  an  information-enhanced  homing  path  [8,9,10],  In  [10]  not  only  is 
the  EKF  performance  improved  by  trajectory  modulation  over  the  propor¬ 
tional  navigation  path,  but  the  performance  of  the  MGEKF  along  these 
information-enhanced  paths  relative  to  that  of  the  EKF  is  impressive. 

Based  on  these  results  an  ad  hoc  guidance  rule  which  seems  to  pos¬ 
sess  the  dual  control  property  is  proposed.  It  is  seen  that  the  trace  of  the 
information  matrix  weighted  by  the  range-to-go  when  combined  with  the 
current  control  performance  index  reduces  to  a  quadratic  form.  This  form 
differs  from  current  forms  in  that  the  performance  index  due  to  the  infor¬ 
mation  measure  is  not  convex.  Some  preliminary  results  are  given  in  [11]. 
It  is  noted  that  this  simple  guidance  rule  produces  trajectories  similar  to 
those  generated  in  [8,9,10]. 

The  essential  difficulty  in  dealing  with  dual  control  problems  is  that  the 
sturcture  of  the  controller  is  not  well  understood.  For  this  reason,  ad  hoc 
schemes  pervade  the  literature,  but  no  rational  scheme  is  ever  suggested. 
For  this  reason,  we  have  begun  looking  into  asymptotic  approaches  to  this 
class  of  problems.  For  small  measurement  and  process  noise  variances,  the 
optimal  control  law,  obtained  from  the  Hamilton-Jacobi-Bellman  PDE  of 
a  particular  nonlinear  problem,  is  determined  in  terms  of  an  asymptotic 


expansion  in  the  state  estimate  and  state  error  variance.  This  problem  is 
chosen  because  the  estimation  process  is  conditionally  Gaussian  and  the 
deterministic  problem  (or  zeroth-order  solution  of  the  Hamilton-Jacobi- 
Bellman  equation)  is  integrable.  Since  it  is  hypothesized  that  dual  control 
problems  are  not  integrable,  the  expansion  about  the  zeroth-order  solution 
should  give  valuable  insight  into  the  structure  of  the  dual  control  problem. 
The  objective  is  to  apply  these  ideas  to  both  the  homing  guidance  and 
adaptive  autopilot  problems. 

There  is  a  real  need  to  determine  the  effects  of  guidance  system  errors  on 
missile  guidance.  To  do  this,  a  measure  of  performance  is  used  in  [12]  which 
is  associated  with  the  optimal  return  function  of  the  LQG  problem  and  has 
the  property  of  a  Lyapunov  function.  Since  the  guidance  laws  considered 
to  date  are  based  upon  the  certainty  equivalence  principle,  the  control  is 
a  function  of  the  filter  or  observer  output.  The  Lyapunov  function  is  a 
function  of  three  terms,  one  associated  with  the  LQ  problem,  one  associated 
with  the  observer,  and  one  associated  with  the  error  in  the  control  law  due 
to  the  inaccuracy  of  the  state  estimate  from  the  observer. 

Finally,  the  very  important  problem  of  target  maneuver  detection  is 
considered.  Our  approach  is  to  develop  target  motion  sensitive  filters  (ac¬ 
tually  observers).  The  theory  has  been  developed  for  time  invariant  linear 
dynamic  systems  [13,14,15].  The  objective  is  to  design  the  detection  gain 
so  that  the  target  motion  can  be  associated  directly  with  the  measurement 
residuals.  Our  present  effort  is  described  in  [15]. 
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A  recently-defined  information  index  is  used  to  enhance  the  information  content  of  minimum-control-efforl 
trajectories  lor  the  homing  missile  intercept  problem.  Optimal  planar  intercept  trajectories  are  obtained  for  a 
performance  index  which  is  control  effort  weighted  by  position  information  content.  The  missile  and  target  are 
assumed  to  be  operating  at  constant  speed.  The  shooting  method  is  u*ed  to  compute  the  optimal  paths:  but 
because  of  the  simplicity  of  the  model,  on-line  optimization  yielding  a  guidance  law  with  information  enhance¬ 
ment  should  be  possible. 


Nomenclature 

A  =cos6/vr 

a  =  missile  normal  acceleration  (ft/s2) 

B  =sin  <t>/uR 

c  =  constant  in  measurement  variance  model  (ft'2) 

G  =  augmented  end-point  function 

H  =  variational  Hamiltonian 

R  -  range  (ft) 

t  =time(s) 

vR  =  ratio  of  missile  velocity  to  target  velocity 
V  =  velocity 

W  =  weight 

X,  Y  =  planar  coordinates  (ft) 
a  =nondimensionai  missile  normal  acceleration 
8  =  missile  velocity  angle 

X  =  time-dependent  Lagrange  multiplier 

v  =constant  Lagrange  multiplier 
|,t)  =nondimensional  relative  coordinates 

p  =nondimensional  range 

r  =nondimensional  time 

<t>  =  missile  velocity  angle 

Superscripts 

( ' )  =  derivative  with  respect  to  t 

(  )'  =  derivative  with  respect  to  r 

Subscripts 

f  =  final  point 

M  =  missile 

R  =  relative 

T  = target 

0  =  initial  point 


Introduction 

N  Ref.  1,  the  problem  of  enhancing  the  information  con¬ 
tent  of  angle  measurements  in  a  homing  missile  engage¬ 
ment  is  considered.  While  the  dynamics  used  in  the  filter 
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development  are  linear  in  the  states  (relative  position, 
relative  velocity,  and  target  acceleration),  the  measurements 
are  nonlinear  in  a  rectangular  coordinate  frame.  Hence,  the 
trajectory  followed  by  the  missile  affects  the  measurement 
sequence  and,  in  turn,  the  ability  of  the  filter  to  extract  the 
states  from  the  measurements.  A  scalar  performance  index 
representing  a  measure  of  the  information  content  of  the 
missile  path  is  developed,  and  a  maximum-information  in¬ 
tercept  trajectory  is  determined.  Next,  measurements  are 
created  along  the  maximum-information  path  and  processed 
with  an  extended  Kalman  filter.  It  is  shown  that  the  filter 
performs  considerably  better  for  measurements  made  along 
the  maximum-information  path  than  it  does  for 
measurements  made  along  a  proportional-navigation  path. 
In  fact,  the  filter  diverges  from  the  true  state  along  the 
proportional-navigation  path  and  converges  to  the  true  state 
along  the  maximum-information  path. 

Since  the  trajectory  determined  from  the  scalar  information 
performance  index  reported  in  Ref.  1  induces  greatly  im¬ 
proved  state  estimation  results,  its  use  in  the  development  of 
an  information-enhancement  guidance  law  is  investigated. 
Because  of  the  complexity  of  the  problem,  the  simplest- 
possible  intercept  problem  is  formulated,  that  is,  two- 
dimensional  motion  of  a  constant  velocity  missile  and  target. 
The  performance  index  is  taken  to  be  the  control  effort 
weighted  by  the  information  index,  and  solutions  are  obtained 
by  the  shooting  method.  However,  to  obtain  initial  values  of 
the  Lagrange  multipliers  required  by  the  shooting  method,  the 
problem  of  minimizing  just  the  control  effort  must  be  con¬ 
sidered  first.  Then,  by  solving  the  weighted  problem  in  stages 
(gradually  increasing  the  weight  from  zero),  the  desired  op¬ 
timal  trajectories  can  be  obtained. 

Statement  of  the  Problem 

The  classical  guidance  law  known  as  proportional  naviga¬ 
tion  is  a  perturbation  guidance  law  about  a  nominal  intercept 
triangle.  The  intercept  triangle  is  essentially  a  minimum- 
control-effort  traiectory  in  :hc  plane  (see  Fig .  I  for 
nomenclature)  l  or  a  constant  velocity,  steerable  missile  and  a 
constant  velocity  target  movine  in  a  straight  line,  the  optimal 
control  problem  is  stated  as  follows: 

Find  the  missile  normal-acceleration  history  a(t)  which 
minimizes 
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subject  to  dynamical  constraints 

X  R  —  F  ^coso  F  ^jCOsO 
Yr  =  Prsin<5  —  Fv/sinfl 
8  =  a/V„ 

and  the  prescribed  boundary  conditions 


(2) 


t0-0,  XKg-R0, 

Y«o=0- 

Q5 

o 

III 

O 

n> 

ty-a  free,  XRf  =  0, 

il 

o 

Of  =  free 

(3) 

This  optimal  control  problem3  admits  the  solution  <t  =  Oor 
9  =  const.  However,  along  this  path,  the  filter  is  not  able  to 
estimate  all  of  the  states  because  the  range  along  the  line-of- 
sight  is  unobservable.  To  enhance  state  estimation,  it  is  possi¬ 
ble  to  weight  the  final  time  with  a  term  associated  with  infor¬ 
mation  content.  The  simplest  form  of  this  term  is  obtained  by 
considering  only  the  position-information  part  of  the 
performance  index  developed  in  Ref.  1.  With  this  term  includ¬ 
ed,  the  performance  index,  Eq.  (1),  can  be  rewritten  as 


J  =  - 


Wc'f  ,  f 

- —  \  a2dt—W\ 
?  J  'n  J 


dr 


to  1  +  c(X2R  +  Yr  ) 


(4) 


where  IV  is  the  weight  and  c  is  a  constant  associated  with  the 
measurement  variance  model  used  in  the  filter.  If  H'=0,  J  is 
the  control  effort;  and  if  IV =  1 ,  it  becomes  the  information  in¬ 
tegral.  Since  a  minimum  is  being  sought  and  since  the  infor¬ 
mation  is  to  be  maximized,  the  minus  sign  is  introduced  to 
convert  the  maximization  problem  to  a  minimization  problem. 
Finally,  when  actually  implemented,  it  is  envisioned  that  IV 
would  be  related  to  the  state  estimation  error  covariance,  in¬ 
creasing  as  the  covariance  increases. 

At  this  point,  the  following  nondimensiona!  variables  are 
introduced: 


Z=\fcXR,  v  —  VcYr,  t-'J cVMr. 
vR  =  VM/ Vr,  «  =  a/cVtf-  p  =  vc R 


In  terms  of  these  variables,  the  optima!  control  problem  is  to 
find  the  missile  normal-acceleration  history  a(r)  which 
minimizes  the  performance  index 


dr 


(5) 


j.lzZV'j  d,-»fV 

2  J  '0  Jr0  1  + 

subject  to  the  system  dynamics 

5 '  =  cos<i>/vR  -  cosO, 
tj'  =s\r\<t>/vR  -sind, 

O'  =a 

and  the  prescribed  boundary  conditions 


(6) 


Vo  -  0.  0o  a  free 


r/  =  free,  £/  =  0,  Vf  =  0>.  0/-  free 


(7) 

(8a) 

(8b) 


This  optimal  control  problem  does  not  yield  an  analytical 
solution  and  is  solved  with  the  numerical  optimization  method 
known  as  the  shooting  method.  because  of  the  sensitivity  ol 
the  shooting  method  to  initial  guesses,  the  problem  is  solved 
analytically  for  IV  0  to  obtain  l.agrange  multipliers.  Then, 
with  these  multipliers  as  initial  guesses,  the  shooting  method  is 
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converged  for  a  small  v  alue  of  IF.  W  is  increased,  and  the 
process  is  repeated  with  the  last  converged  multipliers  as  initial 
guesses. 


Minimum  Control-Effort  Problem 

For  the  case  where  W -0,  the  variational  Hamiltonian  and 
the  augmented  end-point  function  are  given  by 

H  =  a:/2  +  X;  (A  —  cos0)  +  X2  (S  —  sin0)  + 

G  —  I; f/*r  VjV/  (9) 

where  \,(i=  1,2,3)  is  a  time-varying  Lagrange  multiplier, 
v,(i  =  1.2)  is  a  constant  Lagrange  multiplier,  A  =  coso/iv 
and  fi  =  sino/t>fl.  The  Euler-Lagrange  equations2  for  ,\  lead  to 


X,'=  -Ht-0  (10a) 

-  -  Hn  =0  (10b) 

\j-  —H,)=  -\jS\nd  +  \2cos9  (10c) 

where  the  optimal  control  satisfies  the  optimality  condition 
Ha=a  +  \}  =  0  (11) 


Finally,  the  natural  bouncarv  conditions  are 

Kf  =  G(/  =  », 

^!/=Gv=V2 

=  G„f  =  0,  Xj0  =  G)q  =  0 

Hj  =  atf/2  +  X;^  (A  ~cos9f) 

+  \}/(B-sm9,)  +  \3j.af  =  0  (12) 

It  is  observed  that  the  absolute  minimum  control  effort  is 
a  diieved  when  «  =  0.  Whether  or  not  this  can  be  the  solution  is 
r  w  investigated.  Equations  (10a)  and  (10b)  indicate  that  X; 
a  d  X_,  are  constants  so  that  Eq.  (10c)  gives  6 --const.  Hence, 
t  e  system  equations,  Eq.  (7),  can  be  integrated  subject  to  the 
final  conditions  of  Eq.  (8b)  to  obtain 

0- (A -cos0)ry  +  0  =  5  — sinS  (13) 

which  determines  0  and  as  follows: 

sin0  =  5,  tj  =  {0/(cosf?  — A )  (14) 


Y 


i  .. 


X 

Fift.  1  1  wo-dimensional  intercept  neometr). 
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Next,  Eq.  (11)  gives  X,  =0  which  satisfies  the  natural  bound- 

ary  conditions  of  Eq.  (12).  Finally,  Eqs.  (10)  and  (12)  lead  to  "/"■TtJ'+ijJ  +  '  y  '*  ~ cos('/)  +  k.y(B-sinO)  -  0  (19) 


X,  =  0.  \,  =  0  (15) 

These  values  of  X  will  be  used  to  begin  the  solution  of  the 
information-weighted  minimum  cor.troi-effort  problem. 

Minimum  Information-Weighted 
Control-Effort  Problem 

For  IV&O,  the  variational  Hamiltonian  and  the  augmented 
endpoint  functions  are  defined  as 

1-W  ,  W 

H- — — -a2 - r - -  +  X,  (A  -  cosfl)  +  X,  (B  -sinti)  +  X_,cr 

2  /  +  f*+r 


Unfortunately,  this  optimal  control  problem  does  not  yield 
an  analytical  result  so  that  numerical  methods  must  be 
employed.  Here,  the  shooting  method4  is  used  to  solve  the  cor¬ 
responding  two-point  bounda.y-value  problem  (TPIJVPL  It  is 
formed  by  solving  Eq.  (18)  for  the  control  and  eliminating  or 
from  the  remaining  equations  to  obtain  the  differential  system 

? '  =  .l  —  cost) 

i)'  ~  B  —  sint? 

r  -  -Xj /(/-  IV) 

X/=  -2 lf£/(f +  i'  +  r  )* 


G  =  v,$/+v,i)/ 

Next,  the  differential  equations  for  the  X’s  are  given  by 

\;=-2wv/u+z2+v2)2 

Xj  = -Xysinfl  +  X^cos  6  (17) 

while  the  opo.nal  control  must  satisfy 

(l-W)a  +  \3  =  0  (18) 


Xj  =  —  2lVrj/  (I  +  +  r/2)2 

Xj  = -XyCose  +  X,sin0  (20) 


and  the  boundary  conditions 

r0  =  0,  £o=Pt>.  Vo  =  °- 

t/  =  0,  v/  =  0, 

IV 


\,ro 


f  +  $/  +  Vf 


y  +  A  A  —  cos  dr)  +  X_, p  ( B  —  sinfly )  =  0  (21) 


Finally,  the  natural  boundary  conditions  lead  to 
^ lf~vl >  ^2(~v2>  XJo=0 


The  TPBVP  is  solved  by  using  the  initial  Lagrange 
multipliers  for  IE- 0  and  a  small  value  of  IF'.Then.as  IFis  in¬ 
creased,  the  initial  guess  for  the  X’s  is  the  converged  set  for  the 


Table  1  Summary  of  nuincriial  results* 


W 

On.  deg 

s 

Information 

content 

Control 

effort 

</■  ' 

0  00 

22,6199 

0.(K) 

0  f>» 

5.2612 

0.0000 

9  2(>7‘l 

0.01 

17.5956 

-  .002 <9 

-0.00130 

5  6171 

(025 

9..1577 

0.10 

9  RI0O 

-  .02.109 

-  .01244 

5.77s: 

.0102 

9  6648 

0  21 

-  .8789 

■  .04570 

.01142 

.0268 

10  44S6 

(■>,.  -  4.5  f 30fMl  It),  Vp  -  I  o  ,’1  thy.  Tf  =  i*fc.  free 
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previous  IF',  This  procedure  leads  to  the  results  presented  in 
Table  1  and  Figs.  2  and  3  for  the  case  where  the  initial  range  is 
3000  ft,  the  velocity  ratio  is  vR  =  UM/l/7-=  1.3,  and  the  target 
direction  is  0  =  30  deg. 

It  is  noted  from  Table  1  that  as  W  increases  the  control  ef¬ 
fort,  the  information  content  and  the  final  time  increase.  The 
corresponding  trajectories  are  shown  in  Fig.  2.  For  increasing 
W,  the  trajectories  tend  to  move  more  toward  a  tail  chase  and 
oscillate  back  and  forth  behind  the  target.  Also,  the  normal 
acceleration  required  to  perform  the  maneuver,  presented  in 
Fig.  3,  increases  with  W.  For  ^=0.21,  the  highest  normal  ac¬ 
celeration  required  is  approximately  6  g. 

The  trajectories  of  Fig.  2  are  similar  to  those  obtained  in 
Ref.  1  where  the  performance  index  is  just  information.  For 
comparison  purposes,  the  horizontal  projection  of  the  max¬ 
imum  information  path  of  Ref.  1  is  illustrated  in  Fig.  4.  Note 
the  similarity  with  W  =  .21  of  Fig.  2. 


Finally,  solutions  have  only  been  obtained  for  values  of  H' 
up  to  around  0.21.  For  M/>0.21,  the  shooting  method  is 
unable  to  converge  to  a  solution.  It  is  felt  that  the  difficulty  is 
caused  by  the  minus  sign  in  the  performance  index  of  Eq.  (6). 
At  some  point,  the  missile  can  accumulate  information  faster 
than  spending  control  to  accomplish  the  intercept.  Hence,  the 
missile  can  wander  around,  accomplish  the  intercept  at  =  oo, 
and  generate  J  =  -  oo . 

Discussion  and  Conclusions 

A  recently-defined  information  index  has  been  used  to 
enhance  the  information  content  of  minimum  control-effort 
trajectories  for  the  homing  missile  intercept  problem.  Optimal 
information-weighted  trajectories  have  been  obtained  and 
display  the  desired  characteristic,  that  is,  maneuvering  for  the 
sake  of  increasing  information  content.  Because  of  the 
simplicity  of  the  model  assumed  here,  it  should  be  possible  to 
compute  these  optimal  trajectories  on  line  and.  hence,  have  a 
mechanizable  guidance  law  for  information  enhancement. 
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A  Stochastic  Analysis  of  a  Modified  Gain  Extended 
Kalman  Filter  with  Applications  to  Estimation  with 
Bearings  Only  Measurements 
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Abstract— A  new  globally  convergent  nonlinear  observer,  called  the 
modified  gain  extended  Kalman  observer  (MGEKO),  is  developed  for  a 
special  class  of  systems.  This  observer  structure  forms  the  basis  of  a  new 
stochastic  filter  mechanization  called  the  modified  gain  extended  Kalman 
filter  (MGEKF).  A  sufficient  condition  for  the  estimation  errors  of  the 
MGEKF  to  be  exponentially  bounded  in  the  mean  square  is  obtained. 
Finally,  the  MGEKO  and  the  MGEKF  are  applied  to  the  three- 
dimensional  bearings-only  measurement  problem  where  the  extended 
Kalman  filler  often  shows  erratic  behavior. 

I.  Introduction 

THE  construction  of  implementable  observers  for  nonlinear 
deterministic  systems  and  nonlinear  filters  for  nonlinear 
stochastic  systems  remains  a  challenge.  With  a  few  notable 
exceptions  (e.g.,  [15]),  implementation  is  based  upon  ad  hoc 
expansions  and  linearization  techniques.  For  example,  in  [27]  an 
extended  linear  observer  is  developed.  In  [26|  a  nonlinear 
observer  is  implemented  by  augmenting  the  original  state  space  by 
new  states  composed  of  the  quadratic  terms  resulting  from  the 
second-order  Taylor  expansion  of  the  system  nonlinearities.  In 
[24|  the  terms  of  a  truncated  expansion  of  linearly  independent 
functions,  which  approximate  the  system  nonlinearity  is  used  to 
augment  the  state  space  and,  thereby,  construct  an  observer. 
Similarly,  in  stochastic  nonlinear  estimation  problems,  numerous 
filtering  algorithms,  based  upon  series  expansions  to  realize 
approximately  the  conditional  mean,  have  been  suggested  (see  the 
bibliography  of  [81).  However,  such  techniques  are  prohibitive,  in 
general,  even  for  low -order  dynamical  systems  because  of  the 
computational  burden.  Moreover,  stability  analyses  of  such 
schemes  are  quite  rare. 

As  a  computationally  realizable  and  practical  filter,  the  EKF  is 
often  used.  Fortunately,  there  are  many  examples,  especially  in 
high  SNR  problems,  where  the  EKF  is  successful  in  producing 
useful  estimates.  Except  for  a  few  particular  cases,  little  is  known 
about  the  properties  of  the  estimates  (i.e..  stability,  unbiasedness, 
and  convergence)  that  it  or  its  variants  produce  (5|,  [13].  To  begin 
to  understand  some  of  the  properties  of  the  structure  of  the  EKF,  a 
nonlinear  filter  with  a  constant  gain  is  proposed  in  [25]  and  also 
forms  the  basis  for  the  stochastic  stability  analysis  of  [20] .  The 
filter  designed  in  [ 25 ]  is  based  solely  on  stability  considerations  in 
a  probabilistic  Hilbert  space.  Later,  in  [  19]  and  ( 20]  the  stochastic 
stability  properties  of  the  constant  gain  EKF  (CGEKF)  is 
determined  in  the  extended  inner  product  space  M,r.  In  this  way  a 
certain  margin  of  robustness  is  guaranteed  by  calculating  the  gain 
of  the  CGEKF  from  the  algebraic  Riccati  equation  (ARE) 
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associated  with  a  steady-state  Kalman  filter  based  on  a  linearized 
model  of  the  actual  system.  However,  the  convergence  rate  ot  the 
CGEKF  is  found  to  he  too  slow  for  use  in  mans  real  time 
estimation  problems.  To  enhance  convergence  a  gain-scheduling 
scheme  is  suggested  [ 20] .  but  the  stability  analysis  no  longer 
applies. 

The  effort  described  here  is  restricted  to  a  special  class  of 
nonlinear  functions  which  allows  the  stability  analysis  of  [25 1  to 
be  applied  to  an  estimator  where  the  gain  changes  according  to  an 
update  formula  that  is  similar  to  that  of  the  EKF  This  special  class 
of  nonlinear  functions  was  motivated  by  the  class  of  functions 
which  can  be  manipulated  into  a  pseudomeasurement  form  |l). 
[3],  [14],  [  1 8] .  (29|.  For  deterministic  systems  these  pseudo¬ 
measurements  are  linear  functions  of  the  states  of  the  system, 
although  the  coefficient  matrix  is  a  nonlinear  function  of  the 
original  measurements.  By  using  the  pseudomeasurements  in  a 
linear  observer  structure  |2I],  global  stability  can  be  shown. 
However,  if  the  pseudomeasurement  observer  (PMO)  is  used  in  a 
noisy  environment  as  a  pseudomeasurement  filter  tPMF)  |1|.  [3 1 . 
[14],  [18],  biased  estimates  result.  This  property  of  the  PMF  is 
also  shown  in  the  results  of  Section  IV  In  |21)  it  is  shown  for  a 
particular  example  how  the  EKO  (the  extended  Kalman  observer) 
can  be  manipulated  into  the  form  of  the  PMO.  The  essential 
difference  lies  only  in  the  calculation  of  the  observer  gains  A 
modification  of  the  gams  of  the  PMO  is  suggested  in  [21]  which 
enables  the  EKO  to  achieve  performance  similar  to  that  of  the 
PMO.  This  is  called  the  modified  gain  EKO  (MGEKO).  The 
essential  idea  behind  the  MGEKO  is  that  the  nonlinearities  be 
“modifiable.”  This  notion  is  defined  in  Section  II  and  is  the 
central  idea  used  in  developing  the  structure  of  the  estimators. 
This  idea  has  some  similarities  with  the  development  of  the 
pseudomeasurement  function  but  it  is  not  the  same.  For  example, 
the  concept  of  modifiability  also  applies  to  nonlinear  dynamic 
systems  [22],  [23]. 

The  objective  of  this  paper  is  threefold.  First,  the  class  of 
nonlinear  modifiable  functions  is  defined  in  Section  II,  and  then  a 
general  form  of  the  MGEKO  is  shown  to  be  globally  convergent 
under  certain  conditions.  Secondly,  the  stochastic  stability  of  the 
MGEKO  used  in  the  noisy  environment  as  a  modified  gain  EKF 
(MGEKF)  is  analyzed  in  Section  III.  Since  (he  gains  are  not 
constant,  this  analysis  is  related  but  distinctly  different  from  that 
of  |25],  1 19],  |20|.  Note  also  that  since  the  MGEKF  is  based  on 
the  algorithm  for  the  EKF,  the  gain  of  the  MGEKF  is  a  function  of 
only  past  measurements.  This  differs  from  the  gain  of  the  PMF 
which  is  a  function  of  present  and  past  measurements.  Therefore, 
by  eliminating  the  direct  correlation  ot  the  gain  and  measurement 
noise  process  in  the  estimates  of  the  MGEKF.  the  estimation  bias, 
so  prevalent  in  the  PMF.  is  seen  by  the  simulation  of  Section  IV  to 
be  effectively  eliminated.  Therefore,  the  third  objective  is  to  apply 
the  MGEKF  to  the  bearings  only  measurement  problem  (BOMP) 
(Section  IV)  which  has  important  applications  in  naval  engage¬ 
ments  [I  |,  [3],  [14],  [18]  or  for  homing  missile  engagements 
where  passive  seekers  are  used  to  track  the  target.  Comparisons  of 
the  estimation  performance  of  the  MGEKF  with  the  EKF  and 
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II.  The  Modified  Gain  Extended  Kai  man  Observer 
(MGEKO) 

In  this  section,  a  class  of  nonlinear  functions  is  defined  which 
forms  the  basts  of  the  globally  convergent  modified  gain  extended 
Kalman  observer  (MGEKO).  The  gain  structure  of  the  MGEKO 
for  this  particular  class  of  nonlinear  system  is  essentially  the  same 
as  that  of  the  Kalman  filter,  since  the  error  dynamics  of  the 
MGEKO  are  in  the  same  form  as  those  of  a  linear  system.  Even 
though  this  paper  deals  with  discrete,  linear  system  dynamics,  the 
ideas  extend  to  the  continuous-time  case  and  to  discrete  nonlinear 
system  dynamics  (22|.  [231. 

Consider  the  deterministic  case  where  the  system  dynamics  are 
linear,  and  the  measurement  c*  is  a  nonlinear  function  of  the 
states  x,.  i.e.. 


x,,  i  =  A,x, 


■h. \x,) 


a,(x)  -  a,(x)  =  g,(z’ ,  x)(x  -  X) 


where  z*  =  h,(x). 

Note  that  the  difference  a, (a)  -  a,(x)  is  equal  to  g,(z*,  x)(x  - 
x)  without  any  approximations.  Notice  also  that  g,(Z,*,  x)  = 
g,(h,(x),  x)  *  g,(h,(x),  x),  where  this  latter  quantity  is  (if  a,  is 
differentiable)  the  differential  of  a,  evaluated  at  x,  as  used  in 
linearization. 

The  MGEKO  has  the  following  structure. 


X,n=A,X„ 
x,  =x,  +  k,(zf  -  h,(x,)). 


where  x,  can  be  interpreted  as  a  propagated  estimate  at  time  /',  x, 
can  be  interpreted  as  an  updated  estimate,  and  k,  is  some  gain 
sequence  that  may  depend  on  past  and  present  data.  If  h,(x )  is 
assumed  modifiable,  x,  in  (2.5)  can  be  rewritten  as 


x,  =  x,  +  k,g,{z*,  x,)(x,-x,) 


(2.6) 


where  g,(Z*.  x,)(x,  -  Jf,)  =  h,(x,)  -  h,(x,),  and  g,  €  SI**"-  The 
error  e,  in  the  estimates  is  defined  as 


(2.7) 


e,ax,-xt 

=  (I-k,g,{z*,  x,W,-D,e, 
where  from  (2  1)  and  (2.4),  i,  satisfies 
i,kx  ,-x,  = 


(2.8) 


Since  Z ,*  is  a  deterministic  quantity,  (2.7)  and  (2.8)  are  exact 
and  they  are  in  the  same  form  as  that  of  a  linear  estimation  system. 
This  provides  us  with  motivation  for  choosing  a  particular  gam 
sequence  based  upon  Kalman  filter-type  updates.  Specifically, 


'.‘A.-iP.-tAlt+Q, 


(2.9) 


k,  -  m,g,(z*.  X,)T(gfz%  i,)m,g,(z:,  Jf,)r+y,)  1  (2.10) 


■)4  I 

Note  that  if  we  had  the  linear  estimation  problem  wuh  dynamics 
f,.i=/V  il  +  ui,  (2.12) 

and  measurements 


y,  =  g\z*.  x,)l  +  v, 


(2.13) 


(2.1) 

(2.2) 


where  /  G  Z.  (the  nonnegative  integers),  x,  G  It",  z*  G  Stq. 
Consider  the  following  definition  of  "modifiability." 
Definition  1:  A  time-vary  ing  function  a,:U"  -»  Uq  is  modifi¬ 
able  if  there  exists  a  q  x  n  time-varying  matrix  of  functions  g,:Slq 
x  jl"  —  so  that  for  any  x ,  x  G  W  and  iGZ,, 


(2.3) 


(2.4) 

(2.5) 


where  the  white  noise  sequences  ta,  and  u,  have  covariances  Q,  and 
y,,  respectively,  then  (2.9)-(2  1 1)  would  be  the  precise  equations 
for  the  covariances  m,  and  p,  of  the  one-step  predicted  and  filtered 
estimates  of  f,.  respectively.  Here  we  are  assuming  that  z*  and  x, 
are  known.  We  w  ill  view  Q ,  and  y,  as  design  parameters  and  w  ill 
call  m,  and  p,  the  “pseudocovariances"  of  e,  and  respectively. 

If  (A,,  gf  is  uniformly  observable  and  (A,,  Q]  :)  is  uniformly 
controllable,  it  can  be  shown  that  the  error  dynamics  of  (2.7)  and 
(2.8)  arc  globally  convergent  to  zero  by  using  the  Lyapunov 
function  V,(e,)  =  e]p t  'e  in  a  way  that  is  similar  to  the  approach 
used  by  [17).  Note  that  since  g,  depends  upon  the  specific  state- 
space  trajectory,  the  observability  condition  is  also  trajectory 
dependent  in  genera)  and  this  may  be  difficult  to  check. 

Remark  1:  In  the  next  section  white  measurement  and  process 
noise  sequences  are  added  to  the  measurement  (2.2)  and  dynamics 
(2.1).  If  the  gain  algorithm  of  (2.10)  is  used  in  this  stochastic 
environment,  biased  estimates  are  expected,  since  the  gain  and  the 
residual  of  (2.5)  are  directly  correlated  in  a  manner  similar  to  the 
pseudomeasurement  filter  (PMF)  as  shown  in  the  Appendix. 
Therefore,  a  gain  algorithm  similar  to  that  of  the  EKF  which 
ensures  that  the  gain  is  a  function  of  the  past  measurements  only  is 
recommended.  However,  if  the  measurement  equation  is  a 
nonlinear  function  of  the  states,  the  useful  relationship  between 
the  observability  Gramian  and  the  p,  matrix  in  (2.11)  is  no  longer 
available.  This  discussion  motivates  the  assumptions  required  in 
the  development  of  the  modified  gain  extended  Kalman  (liter 
(MGEKF)  presented  in  Section  III. 

III.  The  Modified  Gain  Extended  Kalman  Filter  (MGEKF) 

In  this  section  we  develop  the  MGEKF  and  study  its  stochastic 
stability.  As  slated  in  Remark  1,  the  gain  algorithm  of  the 
MGEKF  is  altered  from  that  of  the  MGEKO  in  order  to  reduce  the 
biases  due  to  direct  correlations  between  the  gain  and  the  residual. 

Consider  the  stochastic  case  where  the  system  dynamics  arc 
linear,  and  the  measurement  z,  is  a  nonlinear  function  of  the  states 
x„  i.e.. 


Xj  =  Ai-tXi-\  +<*>,.  i 
Z,  =  h,(x,)  +  u,&Z*  +  v, 


(3.1) 

(3.2) 


where  {u>,}  is  a  zero-mean  independent  process  noise  vector 
sequence  with  finite  second  moment 


£{<*>, u,r}£?A; 


(3.3) 


where  =  I  if  /  =  j  and  6„  =  0  if  /  *  j.  and  where  {ty}  is  a 
zero-mean  independent  measurement  noise  vector  sequence  with 
finite  second  moment 


E{vv;)=yb,t 


(3.4) 


Furthermore,  the  u>/s  and  v/s  are  assumed  independent. 

Based  on  Remark  1 ,  the  structure  of  the  MGEKF  is  altered 
from  that  of  the  MGEKO.  Furthermore,  since  z*  in  (3.2)  is  not 
available  because  of  the  measurement  noise  u,,  Z*  is  replaced  by  z, 
in  the  gain  formulation.  If  /»,(•)  is  modifiable  and  differentiable, 
the  estimates  of  the  MGEKF  are  obtained  from  the  fol'owmg 
algorithm: 


/>,  =  (/- k,g,(z*.  xfimfl-k'g'izf.  *t))T  +  k, ykE  (2.11) 


x,  =  A,.  i 
S,  =  X,  +  k,(z,  -  h,(x,)) 


(3.5) 

(3.6) 
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=  (3-7> 

k,  =  mfi  f,  +  t.)  ' 1  (3  .S') 

P,  =  U-^,(2,.  Jr,))™,  (/-*,£,(’,■  X))T  +  k^yk*  (3.9) 
where  Ar,  4  dA,(.c,)/d*,|l(tli. 

Note  that  if  the  predicted  measurement  h,(x,)  is  used  in  (2. 10) 
rather  than  z then  the  gain  in  (2. 10)  reduces  to  that  of  (3.8).  In 
this  way,  the  gain  of  the  MGEKF  is  in  the  form  of  the  EKF  which 
ensures  that  the  gain  is  only  a  function  of  the  past  measurements. 
Since  the  stochastic  estimator  (3.6)  can  be  rewritten  in  a 
modifiable  form  without  any  approximation  as 

X,  =  X,  +  c,(x,-X,)  +  k,v,  (3.10) 

where  c,  4  k,g,(z *,  X,),  the  error  equations  produced  from  (3.10). 
(3.1),  and  (3.5)  are  exact.  Although  c,  is  not  implementable.  the 
form  of  (3. 10)  is  important  for  our  analysis  of  the  behavior  of  the 
algorithm.  Furthermore,  it  is  critical  to  MGEKF  performance  that 
g,  in  (3.9)  be  calculated  using  z,.  In  contrast,  the  EKF  algorithm 
[13]  calculates  g,  using  h,(x,)  in  (3.9).  As  a  final  note,  in  [28]  the 
MGEKF  is  applied  to  an  estimation  problem  where  part  of  the 
state  vector  is  composed  of  discrete  valued  random  variables.  For 
this  problem  g,(z,,  X,)  exists.  The  EKF  cannot  be  applied  to  this 
problem  since  the  partial  derivative  hti  does  not  exist. 

A.  Stochastic  Stability  of  the  Intermediate  MGEKF 

In  order  to  facilitate  the  stability  analysis  of  the  MGEKF,  as  a 
first  step  we  employ  an  unrealizable  estimator,  which  used  zT  to 
calculate  the  gain  k,.  Although  this  scheme  is  not  implementable 
for  the  noisy  environment,  it  forms  a  nominal  to  which  the 
implementable  filter  is  compared.  This  estimator,  called  "the 
intermediate  MGEKF"  for  convenience,  is  given  by  the  following 
algorithm: 

X*  =  A,,txf,  (3.11) 


The  errors  in  the  estimates  of  the  intermediate  MGEKF  can  be 


written  as 

e*  =  x,  -  X’  =  A,  if*  ,  +  u),  i 

(3.17) 

where 

e*  =  x,-x  }  =  (l-c*)t*-k}v,. 

(3.18) 

Define  a 

Lyapunov  function  l',(e,*)  as 

K(e*)  =  ‘ef 

(3.19) 

Before 

stating  Theorem  1,  the  following 

assumptions  are 

needed. 

Assumption  I;  A,  in  (3. 1)  is  uniformly  bounded  and  invertible. 

Assumption  2:  L*  =  (/  -  c*)  of  (3.15)  is  invertible  for  all 

i  e  z. 

Assumption  3:  Q,  of  (3.3)  is  uniformly  bounded  from  below 
such  that  Q,  >  a  I  >  0  for  all  /  E  Z. . 

Assumption  4:  m,*~ '  is  bounded  from  below  by  a  constant 
matrix  c  l  such  that 

II  £",(<?*) |!  =  \\e*rm*  -  'e*W  >  c||f*  II J.  (3.20) 

Remark  2:  Assumptions  1  and  3  are  not  terribly  restrictive. 
Note  that  for  the  MGEKO  described  in  Section  II,  the  uniform 
observability  of  (A,,  g ,)  is  sufficient  to  guarantee  that  D,,  defined 
in  (2.7).  is  invertible  and  that  m,~'  is  uniformly  bounded  from 
below  where  m,  is  defined  in  (2.9).  The  corresponding  conditions 
on  L*  and  m*~'  in  Assumptions  2  and  4  are  not  unreasonable, 
although  there  is  no  such  simple  sufficient  condition  that  can  be 
checked. 

Theorem  I:  The  errors  in  the  estimates  of  (3.17)  and  (3.18)  of 
the  intermediate  MGEKF  are  exponentially  bounded  in  mean 
square  with  exponent  6  under  Assumptions  1-4. 

The  following  proof  of  Theorem  1  is  distinctly  different  from 
the  proof  of  [25.  Theorem  4]  where  the  stability  of  the  estimates  is 
based  on  a  constant  gain  nonlinear  estimator. 

Proof:  Rewrite  (3.17)  by  using  (3.18)  as 

t=A,L*i* -A.ky.  +  u,.  (3.21) 


~h,(x}))=x?  +  k;g,(zf 

x;hx,- 

-i  ,*)  +  *>, 

4  i*  +  c*(x, 

-x*)  +  ky, 

(3.12) 

.M/li  +  Q, 

- 1 

(3.13) 

* 

"  * 

11 

m*(fnyi(ftl)m*(hl)T 

+  T,]"1 

(3.14) 

P*  = 

(/-c;)m*(/-c;)r+*; 

*y.k*T 

(3.15) 

Note  that  v,  and  and  u>,  and  L*  are  independent,  since  z*  is 
not  a  function  of  w,  and  X*  is  a  function  of  the  past  measurements. 
Moreover,  since  m,*  ,  in  (3. 13)  is  a  function  of  z*  and  X*.  mf ,  is 
independent  of  i\  and  w,.  Take  the  conditional  expectation  over 
♦,(<?*. i)  -  K, (<?,*).  given  Y*  =  {e0*.  ■.<?,*} 

=  TEyl{L*rA >; ;  | AjL*  -  m*  -  '}#,* 


where  the  superscript  *  means  that  the  estimates  are  obtained  from 
the  gain  algorithm  with  z ,*  =  h,(x,)  in  (3.2)  instead  of  z,.  The 
essential  change  in  forming  the  intermediate  MEGKF  is  that  z* 
rather  than  z,  is  used  to  calculate  p,  and,  therefore,  (3.9)  is 
replaced  by  (3.15).  The  second  equality  of  (3,12)  uses  the 
important  modifiability  characteristics  of  h,  to  be  exploited  in  the 
following  analysis. 

First,  we  consider  the  stability  of  the  intermediate  MGEKF  by 
using  Lyapunov's  second  method  in  the  probabilistic  Hilbert 
space  Lj.  The  norm  of  a  vector  random  variable  x,  is  defined  as 


xjx,r(x,)  dx„ 


(3.16) 


where  x(x,)  is  the  probability  density  function  of  Before 
proceeding  further,  the  following  definition  is  introduced. 

Definition  2 125}:  A  discrete  stochastic  process  x,  is  said  to  be 
exponentially  bounded  in  mean  square  with  exponent  6,  if  there 
exist  constant  0  <  S  <  1,  K,  a  0,  and  K2  >  0  such  that 

||jr,||2sAf,  +/f2(I  -5)'  for  all  <GZ.. 


+  £7,{tr  [ k* tA ,rm * ^  j/4 ,Ar *y, ]  +  tr  [m,* ; J<?,)}  (3.22) 

where  £■?{•}  is  a  conditional  expectation  operator  given  Y*  Note 
that  the  terms  inside  the  tr  operator  become  strictly  positive  for  Q, 
>  a •/  >  0  and  if  m*  obeys  Assumption  4.  Define 

Af,,  =  £T>  [k*r/l ,rm* i \A,k}y,]  +  tr  [m*;jQ,l},  (3.23) 

then  (3.22)  becomes 

ET.fK.der.,)- V, {<*})} 

=  A'|(  +  e*rEf,[L*  tA  (rm*  fxA,L}-my  1  }e,* .  (3.24) 

The  term  inside  the  £*  operator  on  the  right-hand  side  of  (3.24) 
can  be  written  after  some  manipulations  as 

L}TA  Jm} ;  \A,L}  -  m* " 1 

=  L* TA  ,rM ,p •  A  *  +  Q, ) -  'A , L  •  -  m?  - 1 

=  -s*r(s*m*s*r+  y,)  's* 

-L}Tpy'A,  '(Qf  +  A,  Tpf'A;')  1 A;Tpf'L ,*  (3.25) 


*  IL*  S  \ 
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where  the  matrix,  inversion  lemma  |13|  is  used.  .4,  is  assumed 
invertible,  and  s’  of  the  intermediate  MGEKF  satisfies 

5*  =  y,k*rL*  Tm*  (3.26) 

Therefore,  from  (3.25) 

e*T(L‘ TA  Tm* ; \A,L*  -  m*  ~  ')e* < 0.  (3.27) 

Hence,  there  exists  0  <  p,  <  3i  <  1  such  that 

e* TL  * rA  Ttm *  ^  \A,L*e*  =  p,e*Tm*  ~'e*.  (3.28) 

The  existence  of  3i  is  assured  by  Assumptions  2,  3,  and  (3.25). 
Now.  (3.24)  becomes 

£*  {  K.  ,(<?*  i)  -  y,(eT))  sA',  ?,(**)}  (3.29) 

where  0  <  supy€z.  {Aj;}  £  Aj  <  oo,and6,  =  1  -  p,  such  that 
0  <  (1;  <  S,  <  1.  The  boundedness  of  Kv  is  obtained  from 
Assumptions  3.  4,  and  (3. 14).  Note  that  Assumption  3  implies  that 
m *  “ 1  is  uniformly  bounded  from  above. 

By  applying  the  nesting  property  of  the  conditional  expectation 
to  (3.29i.  one  can  obtain 

E%{  K .  ,(er. ,)}  =  £>0{£*v,(  K - 1(* ,)} } 

sawi -«,)£?0{£T,{  f, (<?,*>}} 

=  A, +(l-6,)£f0{  ?,(<?;)}.  (3.30) 

Define  o  as  5  =  inf/€Zi  {5, }  and  note  that  since  0  <  02  £  5,  < 
1,  then  o  >  0.  Applying  (3.30)  recursively  results  in 

£I0{  V. -  .(C  ,)J  s  £.  £  V  -  6V  +  < 1  -  b)‘ *  ‘£V  ^o*)} ■ 

/=0 

(3.31) 

Use  (3.19)  and  take  an  unconditional  expectation  over  Y*  in 
(3.31).  Then 

||e7J|2<*,  +  A':(l-6)'’'  (3.32) 

where  AT,  <  A',  S,”  0  (1  -  5)'/c  =  Aj/c<5  and  Kt  =  E{  yo(ef)\/c. 

Therefore,  the  exponential  boundedness  of  the  intermediate 
MGEKF  is  proved. 

B.  Stochastic  Stability  of  the  MGEKF 

Thus  far,  the  exponential  boundedness  only  of  the  intermediate 
MGEKF  has  been  proved.  Now.  our  objective  is  to  obtain 
sufficient  conditions  for  the  MGEKF  to  be  exponentially  bounded 
in  Li  by  comparing  the  estimates  of  the  MGEKF  to  those  of  the 
exponentially  bounded  intermediate  MGEKF.  Such  conditions  are 
found  again  by  using  Lyapunov’s  second  method  and  are  similar 
in  concept  to  [6J.  In  this  way  the  conditions  for  the  deviation  from 
the  nominal  to  belong  to  the  set  of  nondestabilizing  deviations 
(terminology  excerpted  from  [19])  are  obtained. 

The  errors  in  the  estimates  of  the  MGEKF  can  be  written  from 
(3.1),  (3.5),  and  (3. 10)  as 

i,.\  =x,.| =A,e,  +  u>,  (3.33) 


e,  =  x,-x,  =  (l-c,)e,-k,  i’,.  (3.34) 

Note  that  the  only  difference  between  (3.33)  and  (3.34)  for  the 
MGEKF.  and  (3.17)  and  (3.18)  of  the  intermediate  MGEKF, 
results  from  the  algorithm  for  the  calculation  of  the  gain  of  each 
filter.  That  is,  the  MGEKF  uses  z,  instead  of  z *  in  the  gain 


algorithm  Therefore.  .V,  ,  =  {e,,  i>;,  e,  ,[  contributes  to 
the  difference  in  the  gams  k,  and  k*  since  .V  ,  affects  the 
calculation  of  (3.9).  Denote  the  gain  of  the  MGEKF  as  the  sum  ot 
the  gam  of  the  intermediate  MGEKF  and  the  perturbed  gain  due  to 
\  , 

k,  ~  k*  +  AA,.  (3.35) 

Similarly,  c,  of  the  MGEKF  is  denoted  as 

c,  =  c*  +  Ac, .  (3.36) 

Therefore,  e, . ,  in  (3.33)  can  be  written  as 

e,.,=A,{L*  -Ac,  >«?,-. ■!,(**  +  A*,)e,  +  u>,.  (3.37) 

Consider  the  following  sufficiency  theorem. 

Theorem  2:  If  Ak,  in  (3.35)  is  bounded  and  Ac,  in  (3.36) 
belongs  to  the  set  of  nondestabilizing  deviations  such  that 

(L*  -  A c,)TA  [m*  _ \A,(L*  -  Ac,)  -  m*  - 1  < 0  (3.38) 

for  all  /,  then  under  Assumptions  1-4,  the  error  in  the  estimates  of 
the  MGEKF  is  exponentially  bounded  in  mean  square  with 
exponent  6. 

Proof:  Introduce  the  Lyapunov  function  for  the  MGEKF  as 

P',,1  (*,.  ,)  =  */!  ,/»;;!*,♦,  (3.39) 

where  m?2  j  is  bounded  from  below  by  Assumption  4.  Since  L*  - 
Ac,  in  (3.37)  is  a  function  of  z* .  x,,  and  yV,.|,  L*  -  Ac,  is 
independent  of  u,  and  a),.  Therefore,  the  conditional  expectation  of 
P',.i(c,.i)  -  K(e,)  forgiven  Y,  =  {<?0.  <?i,  •  <?,}  becomes 

[similar  to  (3.22)] 

£h,(^i(<?,.i>- 

=  ijEY,{(L*  -  Ac,)  TA  Jm* ;  ]4,(£*  -  Ac,)  -  m*  ' 1  }e, 

+  £>•  (tr  \(k*+Ak,)TAjm*-t\A,(k*  +  Ak,)y,] 

+  tr  (3.40) 

If  Ac,  satisfies  (3.38)  which  is  a  modification  of  the  left-hand  side 
(3.25)  in  the  presence  of  Ac,,  there  exists  0  <  0  <  p,  <  1  such 
that 

e]{LT  -  Ac,)  TA  > * ; \A,(L  *  -  Ac,)c,  =  p/>,* '  '<?, ■  (341) 
Therefore, 

Ey,{  F,»  i(£,  + 1)—  P, (<?,)}  £  A,  -6,EY,{  ?,(<!,)}  (3.42) 

where  6,  =  1  -  />,,  and  0  <  K,  =  supy€Z+  {Ajy}  where 
A, ,  =  £>-, {tr  [(A*  +  Ak,)rATm*l\A,(k*  +  Ak,)y,]  +  tr  [m*; \Q,)}, 

(3.43) 

and  A',  is  bounded  from  above,  since  k*  is  bounded  from  above  by 
Assumptions  3.  4,  and  (3.14),  and  A k,  is  bounded  from  the 
hypothesis  of  the  theorem.  The  remainde*  of  the  proof  is  the  same 
as  that  of  Theorem  1 . 

Remark  3:  Note  that  the  perturbations  in  (3.38)  are  due  to 
variations  in  the  gain  calculation  and  not  in  the  system  nonlineari¬ 
ties  as  found  in  [  19],  [20].  Therefore,  although  condition  (3.38)  is 
similar  in  form  to  a  condition  given  [19,  Sect.  4.5.2],  the 
derivation  of  this  condition  based  on  a  time-varying  algorithm 
rather  than  a  constant  gain  algorithm  is  different.  Although  this 
global  sufficiency  condition  is  uncheckable  analytically,  it  can  be 
used  as  a  guide  for  engineering  evaluation.  For  example,  a  local 
test  may  be  constructed  for  a  given  initial  state  and  state  estimate. 
The  MGEKF  and  the  intermediate  MGEKF  algorithms  can  be  run 
for  the  same  ensemble  of  measurement  and  process  noise 
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sequences.  For  each  sequence  the  deviations  of  Ak,  and  Ac,  can  be 
calculated  and  the  boundedness  of  A  A:,  and  the  stability  of  the 
MGEKF  via  (3.38)  can  be  assessed.  This  procedure  was  used  in 
evaluating  the  performance  of  the  bearings-only  problem  de¬ 
scribed  in  Section  IV. 

Remark  4:  If  the  processes  are  ergodic.  boundedness  of  A,(k* 
+  A k,)  implies  that  the  exponential  stability  of  the  MGEKF  in  Z,2 
and  the  finite  gain  stability  of  the  MGEKF  in  Mir  are  equal  by  [12, 
Theorem  7). 

Remark  5:  A  time-varying  function  h,:Sl"  -»  Siq  is  called 
approximately  modifiable  if  there  exist  time-varying  matrices  of 
functions  g,:Slq  x  Si"  —  Slq‘"  and  £,:/("  x  Si"  -*  Stqx"  where  e 
=  x  -  x  so  that  for  any  x,  x  E  DC  Si"  and  i  E  Z„,  h,(x)  - 
h,(x)  =  [g,(z*  x)  +  £,(*,  e)]e  where  z*  =  h,(x)  and  where 
lim  ,,M0  || C,(ac,  £)||/||g,(;*  -v)II  0.  The  effect  of  the  error  £,(*, 
e)e  on  stability  is  to  contribute  to  the  deviations  Ac,  and  AAr,  in 
Theorem  2.  For  example,  Ac,  is  replaced  by  the  deviation  Ac,  A 
Ac,  +  (k*  +  Ak,)€,(x,  e)  in  (3.37)  and  also  (3.38).  The  gain 
algorithm  (3.7)  and  (3.9)  uses  the  modifiable  part  g,.  The 
bearings-only  measurements  are  shown  to  be  approximately 
modifiable  for  the  three-dimensional  problem  in  Section  IV  and 
exactly  modifiable  for  the  two-dimensional  case  [21]. 

IV.  Application  to  Bearings-Only  Measurement  Problem 
(BOMP) 

In  [29]  a  new  measurement  model  based  on  a  transformation  of 
the  original  measurements,  called  pseudomeasurements,  is  pro¬ 
posed  which  is  linear  in  the  states  of  the  system  with  a  coefficient 
matrix  composed  of  nonlinear  functions  of  the  original  measure¬ 
ments.  By  using  the  pseudomeasurements,  observability  criteria 
are  rigorously  established  for  the  BOMP  [18].  However,  as 
shown  in  [3],  the  resulting  estimates  of  a  linear  filter  structure  are 
biased.  The  simulation  results  here  demonstrate  this  as  well. 
Moreover,  in  the  Appendix,  the  bias  in  the  estimate  of  the  PMF  is 
analyzed.  By  a  different  approach  this  analysis  generalizes  [3]. 
Since  the  PMF  produces  biased  estimates  and  because  of  the 
nonlinearities  of  the  problem,  most  of  the  studies  have  been 
conducted  using  various  forms  of  the  EKF  for  the  two-dimen¬ 
sional  BOMP  (see  bibliography  in  [10]  and  [16]).  Recently,  [2] 
reported  successful  results  using  the  EKF  for  the  two-dimensional 
BOMP  formulated  in  a  modified  polar  coordinate  when  no 
process  noise  is  present.  However,  as  shown  by  [9]  through 
statistical  consideration,  and  by  several  others  through  simulation 
studies,  EKF  for  the  BOMP  formulated  in  a  rectangular  coordinate 
still  remains  a  problem.  Fortunately,  it  can  be  shown  that  the 
measurement  equation  from  the  two-dimensional  BOMP  formu¬ 
lated  in  a  rectangular  coordinate  is  both  transformable  to  a 
pseudomeasurement  form  and  a  member  of  the  class  of  modifiable 
functions  (see  (21.  Sect.  4.2]).  For  the  three-dimensional  BOMP 
which  is  more  realistic  for  homing  missile  engagement  problems, 
the  measurement  equations  are  approximately  modifiable  (see 
Remark  5  of  Section  III-B  for  the  relationship  with  the  previous 
analysis). 

A.  System  Dynamics  and  Pseudomeasurements  for  the 
Homing  Missile  Problem 

The  deterministic  system  dynamics  of  the  missile  intercept 
problem  written  in  rectangular  coordinates  are  linear 


measured  perfectly  from  the  accelerometers.  The  control  u,  is 
generated  from  the  homing  guidance  law  which  is  derived  by 
using  linear-quadratic  theory  [7]  and  is  the  basis  for  modern 
homing  missile  guidance.  The  dynamic  coefficients  for  (4.1)  are 


h. 

At/x, 

-i(e  ^'+\At-  I)/, 

A  “ 

-  (Ar/2)/> 

A,= 

0, 

|<l-e  ^')h 

A 

.  B,= 

-A//, 

0, 

0, 

e  “  *-*'/) 

0 

- 

_ 

_  _ 

(4.2) 

where  /„  is  an  n  x  n  identity  matrix.  At  is  the  time  interval 
between  measurements,  and  X  is  determined  from  the  bandwidth 
of  the  target  acceleration  assumed  as  a  band-limited  colored  noise 
process.  Note  that  A,  in  (4.2)  satisfies  Assumption  1  in  Section 

m-A. 

The  intercept  geometry,  measurement  angles,  and  relative 
range  are  given  in  Fig.  1.  The  azimuth  and  elevation  angle 
measurements  for  the  three-dimensional  BOMP  can  be  written  as 
(2.2)  where  z*  satisfies 


,  Y< 
tun  ~  _ 

QZ 

=  h,(x,)  = 

A', 

el 

tan  1  Zi/(X]  +  Y;)'  - 

By  using  simple  trigonometric  identities,  the  two  measurements  of 
(4.3)  are  manipulated  into  the  following  pseudomeasurements 

(yfizT)  —  I  A-  Al); 

[y,  1  _  I  sin  az,  -cos  az,  0,  •  •  -  0  j 

y2\  ~  |_  sin  el  cos  az,  sin  el  sin  az,  cos  el,  0  ■  -  •  0  J  X' 

kH,(z‘)x,  =  [0.  OlT  (4.4) 

For  the  stochastic  version  of  the  system,  the  dynamics  (4.1)  of 
the  BOMP  formulated  in  rectangular  coordinates  are  corrupted  by 
additive  process  noise  w,  where  u>,  is  zero  mean  white  noise  with 
second  moment  Q,  =  .  Since  the  target  model  is 

assumed  as  a  continuous  first-order  Gauss-Markov  process  along 
each  axis,  Q,  is  obtained  as  the  resulting  discrete  process  noise 
variance  where  Q*  is  the  assumed  power  spectral  density  for  the 
continuous  input  process.  Q,  is  written  as 

Q,=  \*'  <t>(At  -  t)CQ*Ct4>  T(At  -  t)  dr  (4.5) 

Jo 

where  <t>(At  -  r)  is  A,  with  At  -  r  replacing  A(.  and  C  = 
[0:0:/j]r.  Note  that  Q,  of  (4.5)  satisfies  Assumption  3  of  Section 
IIJ-A.  Similarly,  in  the  noisy  environment,  zero  mean  white  noise 
process  v,  =  [u^,  u2i]  T  is  added  to  h,(x,)  in  (4.3)  as  (3.2)  to  form 
the  noisy  measurement  equations  where  u,_  and  are  azimuth  and 
elevation  angle  measurement  noise,  respectively,  and  £{u,ur}  = 
7-  e  Sl2*2.~ 


% 


-r,*,  =A,x,  +  B,u, 


(4.1) 


where  the  state  vector  x  is  a  nine-state  vector  composed  of  three 
relative  positions  xTR  A  [A",  Y,  Z\,  three  relative  velocities  VrR  A 
[vjr,  uy,  vz],  and  three  target  accelerations  az  A  [ax,  aY,  az],  and 
where  uT  A  (aWf,  a^Y,  as1z\  is  the  three-dimensional  missile 
acceleration  used  here  as  the  control  vector  since  it  is  assumed  that 
the  autopilot  of  the  missile  has  zero-lag.  Note  that  in  implement¬ 
ing  the  estimator,  the  missile  acceleration  is  assumed  to  be 


The  pseudomeasurement  observer  [21]  can  be  extended  to  the 
noisy  environment.  In  that  case,  the  pseudomcasurements  |i.e. . 
(4.4)]  are  corrupted  by  the  state-dependent  noises.  These  state- 
dependent  noises  together  with  the  gain  structure  of  the  pseudo¬ 
measurement  filter  (PMF)  [I],  [21]  cause  the  biases  in  the 
estimates.  In  the  Appendix,  the  biases  in  the  estimate  of  the  PMF 
are  analyzed  by  using  the  innovation  processes  for  the  case  where 
the  process  noise  is  included,  whereas  in  [3],  the  bias  analysis  for 
the  no  process  noise  case  is  studied  by  a  batch  estimation  scheme. 
Since  the  noise  variance  R,  corresponding  to  the  pscudomeasure- 
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ment,  as  shown  in  the  Appendix,  is  associated  with  the  state- 
dependent  noise  (A. 2)  and  7,  is  the  noise  variance  of  the  original 
angle  measurement,  7,  and  R,  are  related  by 

y,  =  Dr'RlDrT  (4.6) 


where  the  range  matrix  D,  is 

V*f+Tj  0 

E ,  =  - 

0  'fx]  +  yj  +  zf 


(4.7) 


note  that  sin  az  =  Y/d,r,  cos  az  -  X,/d,it  sin  el  =  -Z,/djt.  and 
cos  el  =  d\/dir  Therefore.  Dtl  =  D/z,*,  x<)  and  -  D2l(z ,*, 
X,).  Hence,  a,  and  d,  in  (4.9)  satisfy 


a,  D\,. 

0 

sin  azX,  -  cos 

azY, 

0,  J  "  |_  0, 

d2i 

sin  e/0|(  +  cos 

elZ, 

If  we  add  and  then  subtract  P2l  sin  el/D )(  to  0,  of  (4.10),  then 

A  =  f?:,(sin  el  cos  azX,  +  sin  e!  sin  az  Y, 

+  cos  e/Z,  -1- sin  e!  (<?,,- I /Z)1()).  (4.11) 

The  last  term  in  (4.11),  which  we  denote  by  5,(.tr,,  et),  can  be 
bounded  as 


5,(x„  e, )///(•»',.  -v() 

=  -  (XX  +  Y,  >',)  <  (X,  -  X,) 2 

+  (Yl-?,)2£gfe,  (4.12) 


where  f,(x r, ,  x,)  =  tan  el/(d +  Z,Z,).  This  bound  is  used  to 
show  that  the  elevation  angle  measurement  is  approximately 
modifiable.  By  using  H(z*)  given  in  (4.4),  (4.10)  becomes 


[a]  "[*]  + [«*•«>] 

-[oev  ^ 4I] 


(4.13) 


By  using  (a,,  0,]  rof  (4.13)  and  (4.4),  the  measurement  residual  in 
(4.9)  can  be  written  as 


h,(x,)  -  h,(X)  =  -E,H,(z?)(x,-X,)  +  E,{ 0,  &,(x„  e,)/p2l]T 


where 


(4.14) 


EX 


(£>,,  tan  1 

0. 


a,)/a,. 


0 

(p2|  tan  - 1 


0,)/0, 


(4.15) 


Since  the  actual  state  x,  is  not  available,  7,  is  approximated  as 

y(s  0,-'R,DrT  (4.8) 

where  Z5,  is  D,  calculated  using  the  state  estimates  x,  of  the 
estimation  algorithm. 

B.  Modifiability  of  h,(x,) 

In  this  section  it  is  shown  that  the  measurement  equation  of  the 
'two-dimensional  BOMP  is  a  member  of  the  class  of  modifiable 
functions,  and  h,( x,)  in  (4.3)  for  the  three-dimensional  BOMP  is 
approximately  modifiable.  The  measurement  residual  in  (2.5)  is 
now  manipulated  for  the  BOMP  as 


where  d,,  =  fX2  +  Y],  Ut  Dh  k  d,/(XX,  +  Y,  ?,)  and  D2i  k 
d2/{d,dt'  +  Z,Z,)  where  d2,  =  •JX]  +  Yf  +  Zf.  Furthermore, 


If  (4.13)  is  used  explicitly  in  (4.15),  then  (4.14)  takes  on  the 
approximately  modifiable  form  as  defined  in  Remark  5  where  the 
modifiable  pan  is  g,(z*,  *,)d,  =  -£,(z*,  x,)H,(z?)e,  where  £,  is 
E,  evaluated  with  0,  rather  0,,  and  the  remainder  term  becomes 
-{(£,  -  £,)H,(z*)e,  -  £,[0,  5,(x,,  e,)/Dh\T}.  Note  that  the 
nonzero  element  of  E,  -  £,  can  be  written  as  an  explicit  function 
of  5,  as  (0,  tan"1  (5/(1  +  0,0,))  -  6,  tan"1  0,)/Pj,/0,0,.  This 
remainder  has  the  property  ascribed  to  the  term  S,(jr,,  i,)e,  in 
Remark  5  in  an  appropriate  region  D  since  as  ||£,||  -*  0  this 
remainder  is  proportional  to  5,(jr,,  i,)  which  has  the  quadratic 
error  bound  given  in  (4.12).  By  inspection  of  f,(x,,  x,),  the  region 
2D  appears  to  be  all  of.r,  x  €  Jl9  except  for  1/(1/0|(  +  l/d2/  + 
l/|cos  (el  -  el) |)  <  (  where  t  is  some  small  positive  number, 
and  d2f  and  el  are  the  values  of  d2i  and  el  using  the  state  estimates. 

For  the  noisy  environment  g,(z,,  x ,)  is  used  to  calculate  the  gain 
of  the  MGEKF.  where  z,  in  this  case  is  of  the  form  given  in  (3.2). 
Although  the  measurement  el  is  approximately  modifiable  through 
0,  of  (4.13),  the  measurement  az  belongs  to  the  class  of  modifiable 
functions  [21). 

C.  Simulation  Results  and  Comparisons 

The  results  given  in  [21]  are  for  the  two-dimensional  BOMP 
and  noiseless  environment.  The  simulation  study  given  here 
considers  the  three-dimensional  BOMP  where  both  the  noisy  and 
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noiseless  environments  are  included.  The  launch  scenario  is  given 

by 

*0  =  3500  ft,  T0=1500  ft.  Zo=10O0  ft 

vXo  =  -  1 100  ft/s,  o>-0=  -  150  ft/s,  uzg=  -50  ft/s 

arx~  10  ft/s arY-  10  ft/s2,  aTz  =  10  ft/s2 

and  the  initial  estimates  x0  of  the  state  ,v0  are  assumed  to  be 

*o=  3000  ft.  Y0  =  1 200  ft.  Z0  =  800  ft 

&a0=  -950  ft/s,  'uYg  =  -  100  ft/s.  0Zo=  -  100  ft/s 

aT r  =0  ft/s2,  aTy  =0  ft/s2,  aT7  =0  ft/s2. 

'o  'o  zo 

The  filters  (or  observers)  are  initialized  with  a  diagonal  P0  matrix 
where 


The  noise  variance  corresponding  to  the  pseudomeasurement  R, 

(see  the  Appendix)  is  assumed  constant  such  that  R,  =  0.1  /2  and 
the  variance  of  the  process  noise  Q,  in  (4.5)  is  found  using  the 
power  spectral  density  Q*  -  0.1  /j.  For  the  system  dynamics, 

X  =  1  is  used,  and  the  sampling  rate  used  in  the  simulation  is  50 
Hz. 

The  value  of  weighting  between  control  effort  and  terminal 
miss  in  the  quadratic  cost  criterion  to  generate  the  guidance 
command  u,  (7)  is  10  4.  Clearly,  in  practice  the  control  law  is 
mechanized  by  using  the  estimated  value  of  the  states  rather  than 
the  states  themselves  as  implemented  in  our  simulation.  However, 
since  the  emphasis  here  is  on  filter  performance,  the  guidance  law 
is  only  used  to  establish  trajectories  from  which  the  observers  and 
filters  are  tested  and  compared. 

The  performance  of  the  observers  (PMO,  EKO,  MGEKO)  and 
filters  (PMF,  EKF,  MGEKF)  is  measured  here  by  using  the 
histories  of  errors  in  the  estimates.  By  comparing  the  error 
histories  of  the  observers  to  those  of  the  filters,  the  biases,  so 
prevalent  in  the  PMF,  can  be  demonstrated  to  be  effectively 
reduced  in  the  MGEKF. 

As  shown  in  Figs.  2-4  for  the  deterministic  system,  the 
responses  of  the  PMO  and  the  MGEKO  are  quite  similar,  y 
although  the  PMO  performs  a  bit  slower  than  the  MGEKO.  If  the  $ 

initial  errors  were  reversed  in  sign,  then  the  PMO  performs  a  bit  p 

faster  than  the  MGEKO.  Note  that  the  responses  of  the  MGEKO  — 
in  this  case  are  obtained  by  using  the  same  algorithm  as  the  . 
MGEKF.  Therefore,  the  gain  k,  is  a  function  of  the  past  E 
measurements.  This  is  done  to  compare  the  performance  of  the  5 
observer  and  filter  with  the  same  structure  but  used  in  different  B 
noise  environments.  Clearly,  the  EKO  performs  poorly  for  these  z 
relatively  large  initial  errors.  g 

For  the  stochastic  environment,  the  results  of  100  runs  of  g 
Monte  Carlo  simulation  are  presented  in  Figs.  5-7.  The  error  in 
the  range  estimate  at  a  specific  time  /  in  Fig.  5,  for  example,  is 
plotted  by  using  the  value  7 E{ex ,} 2  +  E{ery  +  £’{erJ 2  where 
E{eXl}  is  the  averaged  value  of  the  error  in  the  estimate  of  the  *, 
over  100  runs  of  Monte  Carlo  simulation.  Similar  rms-type 
quantities  are  plotted  in  the  other  figures.  Figs.  5-7  show  that  the 
estimates  of  the  PMF  are  biased  away  from  the  deterministic 
responses  of  Figs.  2-4,  While  the  MGEKF  shows  good  tracking 
performance,  the  EKF  remains  poor.  When  the  initial  errors  are 
small,  the  three  observers  for  the  deterministic  system  perform 
equally.  However,  for  the  noisy  environment,  the  EKF  and  the 
MGEKF  perform  similarly,  while  the  PMF  still  shows  biases 
which  are  quite  affected  by  P0  as  shown  in  (A.  17).  This  is 
particularly  true  for  the  short-time  engagement  problem  of  the 
homing  missile.  Therefore,  at  least  for  this  scenario,  very 


TIME  (SEC) 

Fig.  2.  Errors  in  range  estimates  of  the  PMO,  EKO.  and  MGEKO  for 
noiseless  environment. 


Fig.  3.  Errors  in  velocity  estimates  of  the  PMO.  EKO,  and  MGEKO  for 
noiseless  environment. 
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accurate  estimates  of  the  initial  'latCN  are  critical  for  the  PMF  and 
the  EKF,  while  the  MGEKF  performs  well  under  all  conditions 
To  determine  the  deviations  and  Ac  startme  irom  the  given 
initial  state  and  state  estimate,  the  intermediate  MGEKF  was  run 
using  the  same  ensemble  ot  measurement  and  princess  noise 
sequences  as  used  in  evaluating  me  Mt  ,1  k  t  her  the  ensemble 
ot  realizations  of  these  deviations  die  b  mndedness  of  A/f 
required  by  Theorem  2  was  never  even  Jose  to  heine  violated  and 
the  stochastic  stability  ot  the  MGFkF  nv  condition  <3  'Si  was 
always  satisfied. 


V.  Cnsct.i  sioss 


Appendix 


The  objective  of  this  Appendix  is  to  generalize  the  results  of  [3] 
and  to  show  the  structure  of  the  PMF  used  in  the  simulation  and  its 
inherent  biased  characteristics  in  a  stochastic  environment. 

The  pseudomeasurement  _v,(z*)  in  (4.4)  is  changed  to  the 
following  form  in  the  noisy  environment: 


y,(z,)  =  =Hl(z,)xl  +  nl(x,,  v,) 


(A.  1) 


where  the  noise  corrupted  z,  is  from  (3.2),  and  the  state  dependent 
noise  n,  is 


Mx„  u,)=-l(X~+  (Xf+Yf  +  Z})1'2 v2i  r  (A. 2) 


Uq  and  u2|  are  the  original  angle  measurement  noises  defined  in 
Section  1V-A. 

The  algorithm  of  the  pseudomeasurement  filter  (PMF)  (l), 
[21],  similar  to  the  Kalman  filter,  satisfies 


X, ,  i  =  A,x,  +  Blu, 
x,  =  x,  +  K,(y,(z,)  -  H,(z,)Xi) 

P,  =  U-KlHl(z,))Ml 
K,  =  M,fi(Z,)T(H,(z,W,H,(z,)  T+R,) 
M,4.t  =  A,P,A*J>-Q,. 


(A. 3) 
(A. 4) 
(A. 5) 
(A  6) 
(A. 7) 


Therefore,  the  error  in  the  estimates  e,  of  the  PMF  can  be  written, 
by  using  (4.6).  (A. 3).  and  (A. 4).  as 


e,  =  v,. .it’ll 


(A. 8) 


where 


*„=n  u~KkHk)A =n 

t  =/  4  -J 


(A. 91 


A  new  observer,  called  the  MGEKO  for  a  class  of  nonlinear 
functions  called  modifiable  functions,  is  designed  such  that  it  is 
globally  stable.  A  stability  analysis  ot  the  MGEKF  is  performed 
by  introducing  as  an  intermediate  step  a  nominal  filter  called  the 
intermediate  MGEKF.  In  Theorem  1 ,  the  intermediate  MGEKF  is 
shown  to  be  globally  stable  in  the  probabilistic  Hilbert  space  L:. 
Sufficient  conditions  for  the  MGEKF  to  be  globally  stable  are 
found  in  a  similar  way  to  that  of  the  intermediate  MGEKF.  but  a 
condition  on  the  allowable  deviations  to  be  nondestabilizing  is 
required  in  Theorem  2. 

The  MGEKO  and  the  MGEKF  are  applied  to  the  three- 
dimensional  BOMP  formulated  in  rectangular  coordinates  which 
by  Remark  5  is  approximately  modifiable.  For  the  three- 
dimensional  deterministic  formulation  of  the  BOMP.  the  MGEKO 
performs  in  the  simulation  almost  identically  to  the  PMO. 
However,  the  EKO  can  only  estimate  the  states  when  the  initial 
errors  in  the  estimates  are  small.  For  the  stochastic  formulation, 
these  observer  structures  are  retained.  While  the  estimates  of  the 
PMF  are  biased,  the  EKF  and  the  MGEKF  show  seemingly 
unbiased  characteristics  in  the  simulation.  However,  the  EKF 
appears  only  stable  in  the  small.  Although  the  simulation  results 
indicate  that  the  MGEKF  is  stochastically  stable,  checking  the 
conditions  of  Theorem  2,  with  respect  to  an  ensemble  of 
measurement  and  process  noise  sequences  starting  from  a  given 
initial  state  and  state  estimate,  produces  additional  evidence  of 
stochastic  stability. 


Since  A,  in  (4.2)  is  a  constant  matrix,  the  subscript  /  is  suppressed. 
In  order  to  understand  the  biased  behav  ior  of  the  PMF  estimates,  a 
closed-form  expression  for  the  error  e,  is  to  be  determined. 
Introduce  the  transformation  r.  as 


31,  =  Pit,., 

_  V,  J  ~r'[  V,_,  _ 


A-t+H[R,  'H,QA  r  H[R,  'H,A 
QA  '  ,-l 


*31.-1 

V, 

(A. 10) 


where  31,  £  ;r*\  V,  £  Let  31,,  =  P,,1,  and  V„  =  /. 

Then, 


31,]  _  fain 

V,  J  T‘  |T°[v0 


J  *„.</.  0)  vM»,  °)  ]  [p,.  1  " 

-  _,(/,  0)  I £,,  ,(/,  0)  J  [  / 


(A. 11) 


Since  the  solution  to  the  discrete  Riccati  equation  is 
P,  =  [^2|(/,  0)Po  1  + i^,2  ,(/,  0)](^M  ,(/.  0)P0 ■' +  *„_,(/,  0)]-' 

(A. 12) 


then  P,  =  V,3l,  '.  L,A  in  (A. 9)  can  be  written  in  terms  of  31,, 
after  some  manipulations,  as 


L, A  =  31  -  r3l  T_ 


Now,  e,  due  to  n/s  can  be  written,  by  using  (A.  1 ),  (A. 8),  (A. 9). 
and  (A.  13),  as 


_  py~  T  ^  VjHjR~  'Hjxj  . 
j- 1 


(*,->„  =  'Hr\  +  LAP,  _  XH]_  XR;\H, .  ,.x  ,  +  •  •  ■ 

(A. 14) 

Similarly,  the  e,  due  to  w,  and  the  e,  due  to  ea  can  also  be  written  as 

i-i 

(e,L=Py,-TZ  31/8  'U;  (A.  15) 

J’0 

and 


(e)e0  =  P,V,-rp o'v 


(A  16) 


Therefore, 

e,  =  P,V~  rl  p0  <en  +  £  <u[A  ~  +  V  VjHjR,  '.v 


J  " 


}  '  I 


(A. 17) 
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Because  of  the  last  term  of  (A.  17)  which  is  the  error  due  to  the 
state  dependent  noise  p.  E[et\  is  nonzero. 
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The  Modified  Gain  Extended  Kalman  Filter  and 
Parameter  Identification  in  Linear  Systems* 

TAEK  L.  SONG+  and  JASON  L.  SPEVERt 

.4  new  nonlinear  filter,  developed  for  a  special  class  of  nontineariiies.  has  a  universal 
linearization  with  respect  to  the  measurement  Inactions  and  superior  convergence  ami 
stability  characteristics  when  applied  to  slate  and  parameter  estimation  problems  in 
linearized  systems. 

Kt>  Words  Estimation:  littering:  l extended  Kalman  titters i:  nonlinear  tillering,  observers:  parameter 


estimation:  slate  estimation. 

Abstract  For  a  special  class  of  systems,  a  general  formulation 
and  stochastic  stability  analysis  of  a  new  nonlinear  filter,  called 
the  moditied  gain  extended  Kalman  tiller  iMGEKFl.  is 
presented  Used  as  an  observer,  it  is  globally  exponentially 
convergent  In  the  presence  of  uncertainties  a  nominal 
nonreali/able  tiller  algorithm  is  developed  for  which  global 
stochastic  stability  is  proven  With  respect  to  this  nominal  tiller 
algorithm,  conditions  are  obtained  such  that  the  effective 
deviations  of  the  realizable  tiller  are  not  destabilizing.  In  an 
appropriate  coordinate  frame,  the  parameter  identification 
problem  of  a  linear  system  is  shown  to  he  a  member  of this  special 
class  For  the  example  problems,  the  MGFKF  shows  superior 
convergence  characteristics  without  evidence  of  instability 


I  INTRODUCTION 

For  a  special  class  of  systems,  a  general 
formulation  and  stochastic  analysis  of  a  new 
nonlinear  filter,  called  the  modified  gain  extended 
Kalman  filter  iMGEKF).  is  presented.  The  essential 
idea  behind  the  MGEKF  is  that  the  nonlinearities 
be  "modifiable  nonlinearitics"  implying  a  type  of 
universal  linearization.  This  simple  notion,  defined 
in  Section  2.  is  the  central  idea  used  in  developing 
the  structure  of  the  nonlinear  estimator.  The 
analysis  for  the  MGEKF  was  first  given  in  Song  and 
Speyer  ( 1985)  for  the  case  of  linear  dynamics  and 
modifiable  nonlinear  measurement  function.  Elere. 
the  general  MGEKF  is  presented  for  a  stochastic 
system  where  both  the  nonlinear  measurement 
functions  and  nonlinear  dynamics  arc  in  the  class  of 
modifiable  functions.  The  stability  analysis  pre¬ 
sented  here  is  sufficiently  different  from  that  of  Song 
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and  Speyer  ( 1 9X5 1  lo  warrant  presentation.  This 
generalization  is  partly  motivated  by  the  parameter 
identification  problem  in  linear  systems.  It  is  shown 
that  in  an  appropriate  coordinate  frame,  the 
nonlinear  dy  namics,  associated  with  an  augmented 
state  vector  which  includes  the  unknown  para¬ 
meters.  is  a  modifiable  function. 

Recursive  identification  of  system  parameters  has 
been  widely  studied  in  recent  years.  Among  them  are 
the  identification  procedures  developed  by  Landau 
(1976)  where  his  model  reference  adaptive  systems 
(MR  AS)  technique  is  analyzed  solely  on  the  bases  of 
deterministic  stability  considerations  by  applying 
hyper-stability  concepts  to  the  equivalent  feedback 
representation  ( EFR )  of  the  algorithm.  Ljung  (1977) 
has  developed  a  useful  method,  called  the  ordinary 
differential  equation  (ODE)  method,  to  analyze  the 
convergence  of  the  recursive  identification  algor¬ 
ithms  in  the  presence  of  uncertainty.  Ljung  (1979) 
has  studied  the  convergence  analysis  of  the 
predicted  state  extended  Kalman  filter  (EKF) 
applied  to  the  simultaneous  identification  of  the 
states  and  parameters  of  linear  systems  by  using  the 
theory  developed  in  Ljung  ( 1 977 ).  Ljung  ( 1979)  also 
suggested  a  modification  to  the  predicted  state  EKF 
algorithm  to  assure  its  asymptotic  conv  ergence.  The 
analysis  in  Ljung  ( 1979)  requires  that  the  stability  of 
a  nonlinear  system  he  tested  so  as  to  insure  that  the 
estimates  of  the  parameters  are  stable  and 
convergent  to  stationary  values.  These  stationary 
values  are  obtained  by  keeping  the  current  estimates 
of  the  parameters  inside  the  stability  domain  defined 
such  that  the  estimates  of  the  states  are  exponen¬ 
tially  stable.  Weiss  and  Moore  (19X11)  have 
developed  an  exponentially  convergent  estimation 
algorithm  which  does  not  require  a  stability  test  by 
incorporating  the  Kalman  gain  calculations  inside 
the  system  matrix  of  the  state  estimates.  The 
convenience  characteristics  of  the  estimates  are 
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asv  mptoticull)  equivalent  to  the  modified  algorithm 
of  Ljung  ( 1979 1.  However,  the  modification  in  Ljung 
(1979i  and  Kalman  gam  calculations  in  Weiss  and 
Moore  1 1980 1  certainly  require  more  computational 
effort.  Moreover,  in  those  papers,  the  cause  of 
instability  of  the  algorithm  is  rather  overlooked 
while  concentrating  on  obtaining  stable  estimates. 

In  this  paper  the  MGEKF  is  applied  to  the 
parameter  identification  of  linear  systems.  In 
Section  2.  the  globally  convergent  observer,  called 
the  modified  gain  extended  Kalman  observer 
(MGEKO).  is  developed  for  a  special  system 
composed  of  both  nonlinear  dynamics  and  non¬ 
linear  measurements  in  an  effort  to  generalize  the 
result  of  Song  and  Speyer  (1985).  The  gain 
algorithm  of  the  MGEKO  is  the  same  as  that  of  the 
Kalman  filter.  By  using  Lyapunov's  second  method, 
the  resulting  algorithm  is  shown  to  be  exponentially 
convergent.  Another  exponentially  convergent 
algorithm  for  the  parameter  identification  problem 
can  be  found  in  Anderson  and  Johnson  (1982).  In 
Section  3.  a  stability  analysis  of  the  MGEKF  is 
studied  in  the  probabilistic  Hilbert  space  L1  by 
introducing  an  exponentially  bounded  nominal 
filter  called  the  intermediate  MGEKF.  Later, 
sufficient  conditions  for  the  MGEKF  to  be 
exponentially  stable  in  L2  are  obtained  by 
comparing  the  estimates  of  the  MGEKF  with  those 
of  the  intermediate  MGEKF.  Besides  the  general¬ 
ization  of  the  results  of  Song  and  Speyer  (1985).  a 
stability  analysis  for  the  parameter  identification 
problem  is  studied.  In  Section  5  two  examples  of  the 
application  of  the  MGEKF  to  the  parameter 
identification  problem  are  presented.  A  simple 
example  of  Ljung  (1979)  illustrates  that  a  modi¬ 
fication  in  Ljung  (1979)  is  similar  to  the  MGEKF 
algorithm,  at  least  for  the  transient  period  which  is 
critical  to  stability  of  the  estimates.  Furthermore, 
the  filtered  state  EKF  may  have  global  stability  for 
this  example.  A  convergence  analysis  of  Ljung 
(1979)  is  studied  for  the  MGEKF  using  a  simple 
example  and  compared  with  the  filtered  state  EKF 
analyzed  in  Westlund  and  Tysso  (1980)  and  Ursin 
(1980).  However,  when  the  process  noise  variance  is 
not  known  exactly,  the  MGEKF  has  slightly  bigger 
biases  than  the  filtered  state  EKF.  Finally,  the 
MGEKF  is  applied  to  a  pole  identification  problem 
of  a  linear  time-invariant  system  excerpted  from 
Saridis  1 1974)  where  the  EK  F  is  shown  to  have  poor 
performance.  This  application  also  illustrates  the 
robustness  of  the  MGEKF  since  statistics  of  the 
process  noise  model  are  mismatched  with  that  of  the 
actual  noise.  Note  that  since  it  is  usual  to  model  the 
parameters  as  constants,  the  parameter  estimator 
gains  for  large  time  are  inversely  proportional  to 
time,  and  therefore,  converge  to  zero.  However,  our 
results  extend  to  parameter  models  which  for 
example  are  Gauss  Markov  so  that  the  parameter 


estimation  gains  remain  finite. 

2.  THE  MODIFIED  GAIN  EXTENDED  KALMAN 
OBSERVER  iMGEROi 

In  this  section,  a  globally  convergent  observer 
called  the  modified  gain  extended  Kalman  observer 
(MGEKO)  is  developed  for  a  class  of  nonlinear 
functions  (Song  and  Speyer.  1985).  Song  and  Speyer 
(1985)  have  developed  the  MGEKO  for  systems 
with  linear  dynamics  and  nonlinear  measurement. 
This  is  generalized  here  to  the  case  where  both  the 
system  dynamics  and  measurements  are  nonlinear 
functions  of  the  system  states. 

Consider  the  deterministic  discrete-time  non¬ 
linear  system  governed  by  the  following  equations 

v,. ,  =  /.(.x,l  (2.1) 

a,)  (2.2! 

where  leZ.  (the  non-negative  integers).  ,v,e  H"  are 

state  variables,  and  r*s?A  are  noiseless  measure¬ 
ments. 

The  notion  of  a  modifiable  nonlinearity  is  defined 
as 

Definition  l.A  time-varying  function  a, :  ?/  is  a 

modifiable  nonlinear  system  Junction  if  there  exists  a 
p  x  n  time-varying  matrix  of  functions  //, :  PA  x  R" 
—  ?.p " "  so  that  for  any  x.  .x  e  and  / e  Z  „ . 

u,(.x)  -  u,(.x)  =  //,(-*. .\)<v  -  v)  (2.3) 


where  z*  =  /i,(.v). 

Note  that  V/( cf. .v, ) ( x,  -  x,)  in  (2.3)  is  a  universal 
linearization  with  respect  to  the  measurement 
function  /i,(.x,j  without  any  approximation.  Notice 
also  that  '//,(-*.  .x)  =  V/,(/i,(.y). .x)  *  y/,(/i,(x). ,v), 
where  this  latter  quantity  is  (ifu,  is  differentiable)  the 
differential  of  a,  evaluated  at  .x,  as  used  in  linear¬ 
ization. 

Although  the  class  of  modifiable  functions  is 
small,  it  contains  nonlinear  functions  used  in  many 
practical  estimation  problems.  Two  examples  are 
given  below  which  illustrate  a  modifiable  non¬ 
linearity  in  the  dynamics  and  in  the  measurement 
function. 

Example  1.  Consider  the  simple  linear  dynamic 


system 

with 

an  unknown 

coefficient 

which  is 

analyzed  for  the  noisy  case 

in  Section  5 

y,  ■  i 

=  0,y„  (I, .  , 

=  0, 

v.  (2.4) 

where 

■v,r  * 

[  H,  ].  This 

is  easily 

put  into 

modifiable  form  as 


ff.L 

O.y,  -  o.y,  +  fl.y,  -  0,  v, 

.  ",  _ 

. 

0,  -  n, 

=  //,(-.*.  v.llv,  -  v, )  (2.5) 
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where 


Note  z*  =  v,  is  used  in  (2.6 1.  Note  that  since  no 
differentiation  is  assumed,  parameters  modelled  as 
discrete  valued  random  variables  can  be  included 
(Marcus  and  Westwood.  1 9S4 ). 

Example  2.  For  the  noiseless  two-dimensional 
bearings-only  measurement  problem  (Nardon  and 
Aidala.  1981 ).  the  system  equations  are  governed  by 

.X,.  ,  =  .4.x,  +  Bit,  (2.7) 

=  tan  ‘(V,  A',)  )  (2.8) 

where  -1  and  B  are  known  constant  matrices,  it,  is  a 
known  guidance  command,  and  v,  is  the  system 
state  consisting  of  two  relative  positions  ( A',.  Y, ).  two 
relative  velocities  ( v v,.  v>. ).  and  two  target  accel¬ 
erations  («Tvi,  uTr,)  such  that  =  f.Y.  V.  vv.  \Y.  aTy. 
aTr]! ■  Then  the  nonlinear  measurement  function 
/1,1.x, ) of  (2.8 lean  be  manipulated  into  the  form  /i,(.x,) 
-  (/i,(.x,)  =  V/,(r*, ,x, )(.x,  -  ,v, )  where  (Speyer  and 
Song,  1981 ) 

'//j(-i*. .V, )  =  D,(tan  1  a, )  x,Hiz*)  (2.9) 

where  H(z?)  =  [sin z*.  -cosr*.  0,  0.  0.  0]. 
D,  =  1  (cos;,* A',  +  sin-*y,).  and  x,  =  D,Hiz*)x,. 
The  three-dimensional  bearings-only  measurement 
problem  is  shown  to  be  an  approximately 
modifiable  function  in  Song  and  Speyer  (1985). 
Other  examples  of  modifiable  functions  are  found  in 
Song  ( 1983). 

If  the  observer  for  the  system  of  (2.1 )  and  (2.2)  is 
selected  in  the  form  of  the  extended  Kalman  filter 
(EKF),  then 

•*/♦,  =/,(.x,)  (2.10) 

■X,  =  .x, +k,(r,* -/i,(.x,))  (2.11) 

where  ,x,  is  interpreted  as  a  propagated  estimate  at 
time  /.  x,  is  interpreted  as  an  updated  estimate,  and 
ki  is  some  gain  sequence  that  may  depend  upon  past 
and  present  data.  Suppose  /  (■)  and  hi  )  of  (2.1 )  and 
(2.2)  are  modifiable  functions  such  that 

/.( V, )  -  /,(.x,|  =  c/,|-*..x,)(.x,  -  x,|  (2.12) 

and 


(2.101  and  (2.1  1 )  can  be  written  by  using  (2.1  1. 12  2|. 
(2.10)  and  (2.1  1 )  as 

c, .  i  =  v,.  ,  -  X,.  ,  =  '/,(;*.  x, It1,.  (2.14) 

and 

f,  =  -x,  -  x,  =  (/  -  k,  |r/,(r*.  x, lie,  -  I.,,-,.  (215) 

Since  (2.14)  and  (2.15)  are  written  without  am 
approximations  and  they  are  in  the  same  form  as 
that  of  a  linear  estimation  system,  a  particular  gain 
sequence  based  upon  Kalman  filter-type  updates  is 
chosen.  Specifically. 

k,  =  ill, '/,(-*.  x,  )'('>,(-*.  x,)/n,  '/,(-*.  v,l'  +  y,  l  1 

(2.16) 

/>,  =  (/  -  k, />,( v, ) |(ii,t /  -  k,  </,(r1*.  v, i)'  +  k., ,k/ 

(2.17) 

in,.,  =  x,  v,)'  |2.18| 

Note  that  if  we  had  the  linear  estimation  problem 
with  dynamics 

4,-i  =  '/,lr*..v,  )2,  +  v>,  (2.19) 

and  measurements 

.Vi  =  I-,*..',  >4,  +  v,  (2.20) 

where  the  white  noise  sequences  <•>,  and  v,  have 
covariances  Q,  and  respectively,  then 

(2.16)  (2.18)  would  be  precise  equations  for  the 
covariances  m,  and  p,  of  the  one-step  predicted  and 
filtered  estimates  of  c„  respectively  .  Here  we  are 
assuming  that  z*  and  x,  are  known.  We  will  view  Q , 
and  ;,  as  design  parameters  and  will  call  in,  and  p, 
the  “pseudocovariances"  of  e,  and  e„  respectively. 

If  (  '/,. '/>,)  is  uniformly  observable  and  |//„Q,1  :| 
is  uniformly  controllable,  it  can  be  shown  that  the 
error  dynamics  of  (2  1 4 1  and  (2.15)  are  globally 
convergent  to  zero  by  using  the  Lyapunov  function 
file,)  =  i'! p,  V,  in  a  way  that  is  similar  to  the 
approach  used  by  Moore  and  Anderson  1 1980)  and 
McCarty  (1974).  Note  that  the  uniform  observ¬ 
ability  and  uniform  controllability  involve  a  rank 
test  ol  the  corresponding  Cramians.  Unfortunately. 

o',  \  and  ('/,.  (7,i:)  for  modifiable  nonlinear 
systems  are  realization  dependent  such  that  the  tests 
are  not  a  priori  checkable. 

Remark  I.  In  the  next  section  white  measurement 


/i,(.x,)  -  /i,(  v.)  =  x,)(.x,  -  x,)  (2.13) 


where  '/,(r*.  x,)e  ^,iz*.  x,)e  Then,  the 
actual  errors  in  the  estimates  of  the  observer  of 


and  process  noise  sequences  are  added  to  the 
measurement  (2.2)  and  dynamics  (2.1 1.  If  the  gam 
algorithm  of  (2.16)  is  used  in  this  stochastic 
environment,  biased  estimates  are  expected,  since 
the  gain  and  (lie  residual  of  (2.11)  are  d  i  reel  I  v 
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correlated  (see  Sony  and  Speyer.  |9,X5.  for 
additional  detail  I.  Therefore,  a  gain  algorithm 
similar  to  that  of  the  EKF  which  ensures  that  the 
gain  is  a  function  of  the  past  measurements  onl>  is 
recommended.  However.  if  the  measurement  equa¬ 
tion  is  a  nonlinear  function  of  the  states,  the  useful 
relationship  between  the  observability  Grunian 
and  the  p,  matrix  in  (2.17)  is  no  longer  available. 
This  discussion  motiv  ales  the  assumptions  required 
in  the  development  of  the  modified  gain  extended 
Kalman  filter  i.MGEKE)  presented  in  Section  .V 

3.  THt  MOnitlED  (IMS  I  ATI  \l)l  I)  KALM  \N 
HI  T)  R  iXKit  Kl  i 

In  this  section  we  develop  the  MGEKF  and  study 
its  stochastic  stability.  As  stated  m  Remark  1.  the 
gain  algorithm  of  the  MGEKF  is  altered  from  that 
of  the  MGEKO  in  order  to  reduce  the  biases  due  to 
direct  correlations  between  the  gain  and  the 
residual. 

Consider  the  stochastic  case  where  the  nonlinear 
system  of  (2.1 1  and  1 2.2)  with  additive  noise  becomes 

x,  =  t,  _  |(  v,  - 1 )  +  <■),  .  i  (31 1 

r,  =  +  v,  =  z*  +  r,  (3.2) 

where  again  /(•)  and  /i(  • )  are  assumed  to  be 
modifiable  functions  and  where  '(m,|  and  * v, *,  are 
zero-mean  independent  noise  sequences  with  finite 
second  moments  Q,  and  respectively.  It  is  further 
assumed  that  the  hJjS  and  r,s  are  independent.  Based 
on  Remark  1.  the  structure  of  the  MGEKF  for  the 
above  system  is  similar  to  that  of  the  MGEKO  in 
Section  2  except  that  the  gain  algorithm  is  altered. 
Furthermore,  since  -*  in  (3.2)  is  not  available 
because  of  the  measurement  noise  v„  z*  is  replaced 
by  r,  in  the  gain  formulation.  If  /i,(  )  and  /,(■)  are 
modifiable  nonlinear  functions  and  /i,()  is  differen¬ 
tiable.  then  the  algorithm  of  the  MGEKF  is 
summarized  as 

x,  =  i ( v,  _ , )  (3.3) 

x,  =  x,  +  -  /i,(  v, ))  (3.4) 

ill,  =  '/,,(-  |.x,  ,)p,  ,  (r,  ,.x,  .,  )r  +  Q, 

(3.5) 

k,  ~  -*-  ;,)  1  (3.6) 

p,  =  (/  -  ly//,|:,..v,llm,l/  -  Ai.  '/.lr,.  v, ) ) 7  +-  A,/,A,' . 

(3.7) 


Remark  2.  Note  that  if  the  predicted  measurement 
6,1  x,)  is  used  in  (2.16)  rather  than  then  the  gam  m 
(2.16)  reduces  to  that  of  ( 3.6 1  In  this  wav.  the  cam  of 


the  MGEKF  is  in  the  form  of  the  EKF  which 
ensures  that  the  gam  is  only  a  function  of  past 
measurements.  Since  the  stochastic  estimator  ( 3.4 1 
can  be  rewritten  in  a  modifiable  form  without  any 
approximation  as 

v,  -  v,  x  C,(x,  -  x,)  +  A,v,  ( 3 . S  | 

where  C  -  A,  0',iz*.  v,l.  the  error  equation  produced 
from  1 3. Si.  (3  1  |  and  (3.3)  are  exact.  Although  C,  is 
not  implementable.  the  form  of  (3.8)  is  important  for 
our  analysis  of  the  behavior  of  the  algorithm. 
Furthermore,  it  is  critical  to  the  MGEKF 
performance  that  '>,  in  (3.7  (be  calculated  using  z,.  In 
contrast,  the  I  KE  algorithm  (Ja/winski.  I970| 
calculates  V  using  it, c  v, )  in  (3.7). 

3.1.  Stability  (Dialysis  of  the  intermediate  MGLK / 
In  order  to  facilitate  the  stochastic  stability 
analysis  of  the  MGEKF.  as  a  first  step  we  employ  an 
unrealizable  estimator,  which  uses  z*  to  calculate 
the  gain  A,.  Although  this  scheme  is  not  imple¬ 
mentable  for  the  stochastic  case,  it  forms  a  nominal 
to  which  the  implementable  filter  is  compared.  This 
estimator,  called  "the  intermediate  MGEKF"  for 
convenience,  is  given  by  the  following  algorithm. 


A*  ,(X*  ,) 

(3.9) 

x*  =  x,*  +  A,*(r,  - 

/l,(X,*)l 

(3.101 

»i*  =  a/,  -  ,(r*  ,.x 

+  Q,  - 1 

,(--*. 

Y* 

1.  A,  .  t  1 

(3.11) 

A*  =  »i*(/i*  )'((/)* 

)Hi*(/i* )'  +  y,)  1 

(3.12) 

P*  =  (/  -  7*4,1 

+  A:*y,A*' 

.  x*  »hi,*(  /  -  A, *4, 

(--,*.x,*))r 

(3.13) 

where  the  superscript  *  is  to  distinguish  the 
estimates  of  the  intermediate  MGEKF  from  those 
of  the  MGEKF".  The  essential  change  in  forming  the 
intermediate  MGEKF  is  that  z*  rather  than  r,  is 
used  to  calculate  hi,  and  p,  in  (3.5)  and  (3.7). 

We  consider  the  stability  of  the  intermediate 
MGEKF  by  using  Lyapunov's  second  method  in 
the  probabilistic  Hilbert  space  L->. 

Before  proceeding  further,  the  following 
definition  is  introduced. 

Definition  2  (Tarn  and  Rasis.  1976).  A  discrete 
stochastic  process  x,  is  said  to  be  exponentially 
bounded  in  mean  square  with  exponent  0,  if  there 
existsconstantst)  <  n  <  I.K,  >  O.andKA  >  Osuch 
that 

!!  \ •  K  .Nil  -  oi  (3  I  4 1 

ll.v.ll '  in  (3  14)  is  defined  in  the  probabilistic  Hilbert 
space  /.  •  such  ih.it 

|| X, =  I  x/  v,-(  X,  )d  X,  (3  15) 
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where  rrt  \, )  is  the  probability  density  function  of  v,. 

The  errors  in  the  estimates  of  the  intermediate 
MGEKF  can  be  written  from  (3.1 ).  (3.2).  (3.9)  and 
(3.10)  as 

e*  =  v,  -  x*  =  A  .,(-•-  i..v*. ,  )e*_  ,  +  , 

(3.16) 

and 


i.e. 

-i  =  //.v,  +  v,  (3.21 ) 

Assumptions  2.  3  and  4  can  be  relaxed.  Instead  an 
observability  assumption  is  used.  Relaxing  Assump¬ 
tion  3  is  important  when  applying  the  MGEKF  to 
the  parameter  identification  problem.  The  algor¬ 
ithm  of  the  intermediate  MGEKF  for  this  case  can 
be  written  as 


=  v,  -  v,*  =  (  /  -  KTW.x*)feT  -  kfv, 

=  L*:/,  .  ,(-*.  , .  A  *_  ,  )C*.  ,  -  k*Y,  +  [*<■>,-  , 

(3.17) 

where  L*  is  defined  as  L*  s  /  —  :*.x*l  and 

where  ( 3. 1 6 1  is  introduced. 

Lyapunov  functions  for  e*  and  e*  are  in  the  form 
of 

Lie*)  =  e*Tp*  'eT  (3.18) 

and 

r,(c*)  =  c*rm*  -'«?*.  (3.19) 

Before  staling  Theorem  1.  the  following  assump¬ 
tions  arc  needed. 

Assumption  1:  .q/,-(-,*..v*)  of  (3.16)  is  uniformly 
bounded  and  invertible. 

Assumption  2:  L*  in  (3.17)  is  invertible  for  all  ieZ 
Assumption  3:  Q,  is  uniformly  bounded  from  below 
such  that  Q,  >  a  •  /  >  0  for  all  i e  Z , . 

Assumption  4:  p*  “ 1  in  (3.18)  is  bounded  from  below 
by  a  constant  matrix  i  I  for  all  ieZ*  such  that 

II Kit'.* ill  =  s  cikfii2.  (3.20) 

Remark  2.  Assumptions  1  and  3  are  not  terribly 
restrictive.  Note  that  for  the  MGEKO  described  in 


*  _ 

1  -  1  — 

/,( .V,*) 

(3.22) 

X?  = 

x*  +  I 

k*(r,  -  //,.v,*l 

(3.23) 

m*  = 

A-.( 

-i..v*-  ,)p*- ,  A  i ( 

1-  V,*  |  )' 

+  Q, 

1 

(3.24) 

k?  = 

m*ll! 

{H,m*H!  +  v. )  ' 

(3.251 

P*  = 

(/  -  k 

(3.26) 

For 

the 

linear  measurement 

case,  since 

H,  =  <i,  =  /i,,.  the  update  formula  for  p*  of  the 
intermediate  MGEKF  is  essentially  the  same  as  that 
of  the  Kalman  filter.  Therefore,  positive  definite¬ 
ness  of  Q,  in  Theorem  1  can  he  relaxed  to  show  the 
global  stability  ofthe intermediate  MGEKF.  With  a 
small  modification  of  the  method  suggested  by 
Moore  and  Anderson  (1980).  the  invertibility  of 
.:/,(-*.  v*)  can  be  relaxed  to  prove  the  stability  of  the 
intermediate  MGEKF'.  However,  here  the  in¬ 
vertibility  of  .V1(r*..v1*)  is  kept  to  develop  Theorem 
2 

An  important  aspect  of  the  case  of  linear 
measurements  is  that  for  k*  =  />*//,'/,  1  the 
following  inequality  holds  for  every  Q,  >  0  and 
nonsingular  .rV,  (see  the  Appendix  in  Moore  and 
Anderson.  1980.  for  a  more  general  case) 

#>,*-7  -  V.r-  xLVpT'  'W  -, 

>  ,L*r ///y,  '//,/,*  |  (3.27) 


Section  2.  the  uniform  observability  of  (  c/,.^)  is 
sufficient  to  guarantee  that  L,  defined  in  (2.15),  is 
invertible  and  that  p,  1  is  uniformly  bounded  from 
below.  The  corresponding  conditions  on  L*  and  p,  1 
in  Assumptions  2  and  4  arc  not  unreasonable, 
although  there  is  no  such  simple  a  sufficient 
condition  that  can  be  checked. 

Theorem  1  The  errors  in  the  estimates  of  the 
intermediate  MGEKF  for  the  system  of  Equations 
(3.1 )  and  1 3.2)  are  exponentially  bounded  in  mean 
square  with  exponent  .)  under  Assumptions  1.  2.  3 
and  4 

Proof.  See  Appendix  I. 

The  objective  of  the  following  is  to  show  that 
w  hen  the  measurement  equations  of  (3.2)  is  linear  in 


where  L*  =  (/ —  k*H,|.  Moreover,  if  (:/,.//,)  is 
uniformly  observable,  p*  ~  1  is  uniformly  bounded 
from  below.  Therefore,  Assumptions  2.  3  and  4  are 
not  needed  in  the  proof  of  the  exponential 
boundedness  of  the  errors  for  the  intermediate 
MGEKF 

Note  that  the  Riccati  equation  for  the  in¬ 
termediate  MGEKF  can  also  be  obtained  from 
minimizing  A 


=  v'  n*  ■ 1 


v„  +  V  v/  ’//,v, 


+  «•/<).  'w,  (3.281 

subject  to  v,.  ,  -  V,v,  -t-  h, sv,.  The  minimum  value 
ofA  is  x{p*  1  \  v  Note  that  Q ,  of  (3  2 s I  is  related  to 
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Q,  m  the  algorithm  of  the  intermediate  MGI  KF  as 
Q,  =  h.Q,h! .  and  Q,  can  he  factored  as  0  (J,  0  1 . 

Lemma  1 

Q  '  p*  'Q  is  uniformly  hounded  for  all  V  £  /  . 

Proof.  If  one  chooses  \  =  Q  i  where  i  t  then 
vc£  Range  |  U  ).  where  Range  (III  denotes  the  range 
space  of  the  controllability  Gramian  11  Since 
Range  (It)  3  Range  i(Jl.  \  is  a  controllable  state. 
Therefore,  from  the  fact  that  the  minimum  cost  of 
(3.28 1  is  finite  for  any  controllable  states.  \ ' 1  \ 

=  y'Q  'p*  'Q  \  <  M  <  /.  for  all  \  ■;  /  .  and  for 
all  ye 

Belore  stating  Theorem  2  consider  the  following 
observability  assumption. 

Assumption  5:  For  every  A.  and  some  Y  >  0.  there 
exists  a  fi  >  0.  such  that 

k  -  \ 

Sk-s  k  =  I  0[kH[H0u  S  I!  ■  I  >  0  (3.29i 


where  <f>lk  =  FI  7r  (Note  that  Assumption  5 
implies  Assumption  2.) 

Theorem  2.  If  the  measurement  equation  is  linear  as 
(3.21 ).  Assumptions  1  and  5  are  satisfied,  and  the  u  ,s 
and  lys  have  finite  second  and  fourth  moments,  then 
the  errors  in  the  estimates  of  the  intermediate 
MGEKF  are  exponentially  bounded  in  mean 
square  with  exponent  0  for  positive  semidetinite  Q,. 

Proof.  See  Appendix  1. 

3.2  Stability  analysis  of  the  MGEKF 
So  far  the  exponential  boundedness  only  of  the 
intermediate  MGEKF  has  been  proved.  The 
objective  of  this  section  is  to  obtain  sufficient 
conditions  for  the  MGEKF  to  be  exponentially 
stable  by  comparing  the  estimates  of  the  MGEKF 
with  those  of  the  exponentially  stable  intermediate 
MGEKF.  Such  conditions  are  found  again  b\  usinsi 
Lyapunov's  second  method.  In  this  way.  the 
conditions  for  the  deviation  from  the  nominal  to 
belong  to  the  set  of  nondestabili/ing  deviations 
(Safonov.  19. .0)  are  obtained. 

The  errors  in  the  estimates  of  the  MGEKF  can  be 
written  from  (3.1 ).  (3.2 1.  1 3.3 1  and  1 3.4 1  as 

e,  =  v,  -  v,  1/  -  L",( -,*■  v,Uc,  -  k,\,  (3  30) 


'  C  1  *  -  1  •  '  1  1  l<  ,  | 


»  »  >  J 


/'A 


The  only  difference  between  ( 3.30 1  and  1 3  3 1 )  of  the 
MGEKF.  and  (3.1 61  and  (3.1 7)  of  the  intermediate 
MGEKF.  results  from  the  algorithm  for  the 
calculation  of  the  gam  filter.  That  is.  the  MGEKF 
uses  instead  of  j*  in  the  gain  algorithm  Therefore. 
Y,  1  =  1 1 ,.  1 ... . .  v,  , ;  contributes  to  the  difference 
in  the  gain  calculation  and  consequently  Y  , 
allects  the  estimates.  7,  ]  and  '/,.  Since 
C  ’  =  A,' \f )  is  evaluated  by  the  inter¬ 
mediate  MGEKF.  then  it  is  convenient  to 
write  C,  =  k,  v,  1  of  the  MGEKF  as  used 
in  1 3  30 1  as  the  sum  of  C*  and  the  perturbation 
AC  as 

AC,  1 3.32  I 

where  \C,  is  dependent  on  r,*  and  Y,  ,.  Similarly. 
7,  itr,“.  v,  ,  1  and  A,  ol  the  MGEKF’  is  defined  as 

1 1  -,  ) .  v,  1 1  =  7,  1  ( -  *  i .  v*.  ] )  4-  A.7,  _  j. 


k,  =  k*  4-  AA,.  (3.34) 

Consider  the  following  sufficiency  theorem  which 
states  the  conditions  for  the  deviations  AC,  and 
A  7.  ,  to  belong  to  the  set  of  nondestabilizing 
deviations  such  that  the  errors  in  the  estimates  of  the 
MGEKF  are  exponentially  bounded. 

Theorem  3.  If  AA,  in  (3.34)  is  bounded  and  AC',  in 
(3.32)  and  A 7,  ,  in  (3.33)  belong  to  the  set  of 
nondestabilizing  deviations  such  that 

(/  -  C,*  -  AC,)r/>*  '(/  -  C*  -  AC,)  -  <  1  <  0 


C/,  ..a,*  ,) 

+  A7  ,-,)V  ‘(7,  1  ( | .  v* .  j  | 

+  A. 7,  1 )  -  p*.  <  0  (3.36) 

where  p*.  mf  and  />,*  ,  are  quantities  from  the 
intermediate  MGEKF,  then  under  the  Assumptions 
I.  2.  3  and  4  for  the  system  of  (3.1 1  and  (3.2).  or  the 
Assumptions  I  and  5  for  the  system  of  (3.1 )  and 
(3.21  l.  the  errors  in  the  estimates  of  the  MGEKF  are 
exponentially  bounded  in  mean  square  with 
exponent  a. 

Proof.  See  Appendix  1. 

Remark  4  conditions  (3.35)  and  (3  36)  can  be 
combined  as 

(7,  ,  +  a 7,  ,)'(/.:  -  ac.iV:  1 

(/*  -  AC, II  7,  ,  a-  A  7,  ,  1 

-r.*i'  <0  (  3-37) 
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where  V,  - , .  L*  =  /  -  fc*  '',( xT  I.  P*  and  /»,*, ,  are 
quantities  from  the  intermediate  MGEKF.  If  the 
measurement  equation  is  linear  in  v,  as  (3.21). 

=  //,  and  AC,  =  Ak,ll,  are  used  in  the  above 
inequality . 

Remark  5.  Although  the  global  sufficiency  con¬ 
ditions  (3.35)  and  (3.36)  or  (3.37)  are  uncheckable 
analytically,  they  can  be  used  as  a  guide  for 
engineering  evaluation.  For  example,  a  local  test 
may  be  constructed  for  a  given  initial  state  and  state 
estimate.  The  MGEKF  and  the  intermediate 
MGEKF  algorithms  can  be  run  for  the  same 
ensemble  of  measurement  and  process  noise 
sequences.  For  each  sequence  the  deviations  of  A k, 
and  AC,  can  be  calculated  and  the  boundedness  of 
A k,  and  the  stability  of  the  MGEKF  v  ia  (3.35)  and 
(3.36)  or  (3.37)  can  be  assessed. 


4  MODIFIABILITY  Of  THE  SYSTHM  DYNAMICS  OF 
THE  PARAMETER  IDENTIFICATION  PROBLEM 
Consider  the  follow  me  scalar  dy  namic  model 


=  /#.«,  - 1 

+  ■  •  •  +  /fn".  -  n  (4.1) 


where  n  is  assumed  known  and  minimal,  and  iti  - 1  is 
a  known  scalar  input  at  time  i  -  1.  The  x;s  and  /i,s 
are  the  constant  parameters  to  be  identified.  The 
observable  canonical  form  (Chen.  1470)  of  (4.1 )  is 
obtained  in  the  state  space  as 

y,  =  4y,  _ ,  +  Bit,  .  , 

(4.2) 

=  Cy, 

where  r,  =  [>  ,.>: . y„ ][  and  where  1.  B  and  C 

satisfy 


B=  IL 


C'  =  o 


0.  0  .  .  1. 


If  unknown  x,s  and  //,s  are  augmented  to  the 
original  state  space  i".  the  augmented  vector  v  in 


x!  =  [v,  ■  v„.  x„...  x,. /(„... /I,  ],.  (4.4) 

where  x,  satisfies  a  nonlinear  dynamic  system 
expressed  in  (2.1)  but  w  ith  the  addition  of  a  known 
input  a,. 

It  is  shown  below  that  /()  corresponding  to  (4.2). 
manipulated  in  the  augmented  state  space  is 


modifiable.  Consider 

•4y,  i  +  Bit,  i  —  .4,  i  y,  i  —  B,  \  a,  i 

=  ii  i ( y,  i  —  y,  i  i  +  (  t  —  i,  i  >y,  i 

+  I B  —  B ,  ]  lit,  |  (4.5) 


(.4  -  .4.  .,)>•..  ,  = 


x,  x„ 

■  x„  I  +  x„  - , 


.-X,  -r  X, 


Therefore.  -  [  in  (2.12)  for  this  ease  can  be  written 


■rft-  I.  Yi- l) 


■4,-|  —  Z*-  |  I rt  It,  -  \ 


where  is  the  q  x  q  identity  matrix 
5.  APPLICATIONS 

5.1  Relation  to  Ljung' s  modification  and 
con  verge  nee  analysis 

Asymptotic  behavior  of  the  predicted  state  EKF 
as  a  parameter  estimator  for  linear  systems  is 
analyzed  in  Ljung  (1979).  In  Ljung  (1979).  the 
estimate  (1,  of  the  parameter  0  is  sought  for  the 
system 

v,-*  i  =  A((l)y,  +  w,  (5.1) 

r,  =  C(0)y,  +-  v,  (5.2) 

where  to,  and  v,  are  zero-mean  white  noise  with  finite 
second  moments  Q,  and  respectively.  The 
structure  of  the  predicted  slate  EKF,  which  is  an 
example  of  the  recursive  prediction  error  (RPE) 
algorithm  (Moore  and  Weiss.  1979),  is 

v. - 1  =  ./,(  v,l  +  KA:,  -  /i,(.v,n  (5.3) 

where  the  estimate  of  the  augmented  state 
x!  =  [y!  .<V  ]  If  the  derivative  of /,(  (with  respect  to 
.v,  is  defined  as 


el  As,) 


r.4(d,)f, 


■\A.  M, 
0  / 


and  the  derivative  of/i,(  )  with  respect  to  v,  is  defined 


-  fort,,  ‘  [c,:  o, 

t  \,  l  <  o,  j 
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and  £|v,cj'  |  =  0  for  all  i. /.  then  the  gain  K ,  of  the 
predicted  state  F.KF  of  |5.3)  is  obtained  from  the 
following  algorithm. 


K,  =  [A, Pi ,C,r  -r  \1,P[C!  +  A,P2D! 
+  .Y/./ND,' }S~ 1 

s,  =  c,/5,  C,r  +  C.Pj  D[  +  D,P[Cl 
+  D.PyDj  +  7i 

l,  =  [p;,c,r  +  p$pi]s~ 1 

-4.P,  .4/  +  A,plt\l!  +  U,P:.A! 
+  M,PhM[  -  ~K,S,K[  +  Q , 
Pz..,=  A,P2 ,  +  M, Pi,  -  K.-S.-tf 
Pj..,  =  Pj.  - 


where  the  partition  of  covariance  matrix  P,  of  the 
state  v, 


,  =p\  p'- 

■  [pf  Pj 


is  used  in  the  algorithm  and  the  initial  value  of  the  P, 
matrix  is  Pt,  =  diag  [P,„,  P3J. 


5.1.1  Ljungs  modified  EKF.  As  shown  in  Ljung 
(1979).  the  predicted  state  EKF  is  not  asymp¬ 
totically  stable.  This  fact  is  analyzed  from  the 
stability  of  the  ordinary  differential  equations 
obtained  by  decoupling  the  estimates  and  co- 
variance  of  the  state  of  (5.3)  under  the  assumption 
that  the  stationary  (or  equilibrium)  value  of  0  is 
obtained.  However,  the  stability  domain,  defined 
such  that  the  estimates  of  the  system  are 
exponentially  stable,  is  not  explicitly  specified  in 
Ljung  (1979).  In  Ljung  (1979)  the  estimator  of  (5.3) 
is  also  modified  to  be  asymptotically  stable  at  the 
stationary  point.  This  modification  includes  the 
change  in  the  gain  K,  (see  Theorems  7.1  and  8.1.  and 
Corollaries  7.1  and  8.1  of  Ljung.  1979)  such  that 


K,  =  [,l,P,.C,r  +  UrPtC!  +  A,P2iDJ 

+  m*p2,d[  ]sr 1 


•V/,*  =  M,  +  -  C.F.) 

(  0 


where  K  and  0  are  stationary  values  of  K,  and  0,. 
respectively,  and  <  K  <~fl  is  obtained  from  an 
approximated  algorithm  (Equation  (7.8)  of  Ljung. 
1979).  Since  the  modified  gam  K,  of  (5.6)  is  a 
function  of  the  present  measurements,  biased 
estimates  are  expected  during  the  transient  period 
where  P2,  is  nonzero  (Song  and  Speyer.  1985: 
Aidala  and  Nardon.  1982).  For  example,  in 
Westlund  and  Tysso  (1980)  and  Ursin  (1980).  a 


simple  example  from  Ljung  (1979).  i.e. 


y, .  i  =  0,y,  +  o , 


«...  =0, 


-i  =  y,  +  vi- 


is  used  to  show  how  the  filtered  state  of  the  FK  F  (the 
usual  EKF)  has  asymptotic  convergence  charac¬ 
teristics  by  using  the  same  method  of  Ljung  (1979). 
However,  if  Ljung’s  modification  (5.6)  is  applied  to 
the  above  example.  M*  in  (5.7)  becomes 


MT  =  f.  +  n  -  v.) 

P i,  +  7, 


which  can  be  written  as  A/*  =  v,  +  ( —  c,)  =  if 
Pi.  >  Vr 

In  order  to  avoid  biases  in  the  estimates,  the 
estimator  of  (5.3 1  with  the  modified  gain  K,  of  (5.6)  is 
changed  to  the  form  of  the  filtered  state  EKF.  i.e. 


-v.  =./•  -  i(Xi-i) 


V,  =  v,  +  k,i-i  —  Ffx'i). 


where  x  =  [y.0]T.  and  //  =  [1.0].  For  the  filtered 
state  EKF.  Ljung  s  modification  would  turn  out  to 
be  the  algorithm  of  the  .V1GEKF.  at  least  for  the 
transient  period  which  is  critical  to  the  stability  of 
the  estimates. 

Note  that  /  (•)  of  the  above  example  is  modifiable 
such  that  c/,  in  (2.12)  for  the  system  of  (5.8)  can  be 


written  as 


In  order  to  calculate  the  gain  of  the  MGEKF. 


V.-Ci.x,-)-  Q  ^ 


is  used,  while 


<7.  =  f  .V.' 

rxj  0  1 


is  used  in  the  EKF  to  calculate  the  gain.  Since  the 
original  system  of  (5.8)  is  linear.  (5.9).  (5.10)  and 
(5.11)  are  similar  in  form.  Since  the  intermediate 
MGEKF.  which  uses  .V,  in  (5.9)  to  calculate  the 
gain,  is  globally  stable  in  the  mean  square  sense  by 
Theorems  I  and  2  of  Section  3.  the  exponential 
boundedness  of  the  EKF  for  this  example  can  be 
obtained  by  a  procedure  similar  to  that  used  to 
prove  Theorem  3.  The  result  is  that  the  EKF  for  this 
example  is  exponentially  bounded  if  A^,.  which  is 


X1 

tty  v 


i 


the  difference  between  k*  of  the  intermediate 
MGEKF  and  k,  of  the  EKF.  is  bounded  and  if  the 
inequality 


mit  =  [»ij  —  m:s  7/iij,  -| 


U  7 

1 5.::  i 


<7.-7r 

r.v,  _ , 


!/  -  k,H)rrri\i  -  m/> 


-  pr- 1 


<7.  , 

('.V,  _  , 


(5.12) 


is  satisfied  where  p*  and  />*_ ,  are  calculated  from  the 
intermediate  MGEKF.  and  t'J,  - ,  c. v,_  t  and  At,-  are 
evaluated  from  the  estimates  of  the  EKF.  If  the 
above  conditions  are  always  satisfied,  then  the  EKF 
for  this  case  is  globally  stable.  Otherwise,  the  regions 
of  satisfaction  of  (5.12)  provide  the  stability  domain 
for  the  filtered  state  EKF.  If  the  conditions  of 
Theorem  3  are  satisfied,  the  errors  in  the  estimates  of 
the  MGEKF  are  exponentially  bounded  in  the 
mean  square  and  the  stationary  value  of  the 
estimate  of  parameter  (>  is  obtained  in  that  sense. 

5.1.2  Convergence  analysis  of  the  MGEKF.  The 
follow  ing  is  a  convergence  analy  sis  of  the  MGEKF 
for  the  scalar  system  of  1 5.81.  A  convergence  analysis 
of  the  predicted  state  EKF  for  the  same  sy  stem  was 
given  originally  by  Ljung  (1979).  and  similar 
analyses  of  the  filtered  state  EKF  are  found  in 
Westlund  and  Tysso  (1980)  and  Ursin  (1980). 
Satisfaction  of  the  regularity  conditions  necessary 
for  the  proof  of  convergence  is  relegated  to 
Appendix  I  of  Ljung  (1979|.  After  some  manipu¬ 
lation.  the  MGEKF  algorithm  which  estimates  the 
augmented  state  x,  of  the  system  (5.8)  yields 


The  objective  is  to  demonstrate  the  local  con¬ 
vergence  properties  of  the  MGEKF  using  the 
linearized  differential  equation  of  Ljung  (1979) 
about  the  stationary  point. 

It  can  be  shown  that  in:  and  m ,  tend  to  be  zero  as 
i— *  x  (Ljung.  1979:  Moore  and  Weiss.  1979). 
Therefore,  for  Q  =  -/  =  1  as  /— *  x,  ni,i—*nil  and 
(5.20)  reduces  to 

in |  =  f):(in,  —  ihjliii,  +  1 )~  1 )  +  1.  (5.23) 

This  implies  that 

(V-  +  .  O4  +  4 

m,  = - -v -  (5.24) 

where  d  is  the  stationary  value  of  the  estimate  of  the 
parameter  obtained  under  the  assumption  that  the 
conditions  of  Theorem  3  are  satisfied.  Therefore.  O', 
which  is  the  limit  value  of  G,  as  i  — »  x.  satisfies  using 
(5.17) 


G  = 


in ,  +  1 

By  using  (5.22)  in  (5.21 ).  / h2,  satisfies 

///.>,.,  -  (d,  -  0,G,)inli  +  :imiL . , 

The  process  IF  satisfying 


y,  =  Pi  +  G,(r,  -  y.) 

(5.13) 

Pi  =  d|  -  1  Pi  ~  1 

(5.14) 

0,  =  0,  -f-  L,(Z{  —  y, ) 

(5.15) 

d,  =  (V, 

(5.16) 

G,  =  »!,,(»),,  +  •/,)*  '  =  /111, if  1 

(5.17) 

L,  =  »n2j(m i ,  -+■  •/,)” 

(5.18) 

U, 


=  id  -  OG) It,  + 


(5.25) 


(5.26) 


(5.27i 


is  related  to  the  ni:  process  giv  en  by  (5.26)  as  /  — >  x 
with  the  assumption  of  constant  ih}  (Ursin.  1980). 
i.e. 


ih2l  =  [ll/fo  3, 


(5.28) 


The  matrix  in,  is  partitioned  as 


7 

w»|, 

m  2, 

J  »/.,  = 

m2i 

'»3,_ 

where  iih  =  ini ^  and  in,  =  iinx  .  From  (5.15)  and 
(5.16) 


0, .  t  -  0,  =  ;  L,i:, 

i 


where  L,  =  //.,  and  —  z,  —  t\- 1  f,  -  i-  I  rom  the 
assumption  of  ergodicity  of  the  processes  and  the 
relation 


L,  =  llhjll  !  +  1 )  '. 


the  RHS  of  the  differential  equation  associated  w  ith 
0  at  the  stationary  point  in  Ljung  (1979)  will  be 


(5.31) 
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Asshownin  Ljung  ( 1979).  the  sign  ol'Ej M7:|  decides 
the  stability  of  A  since  ihqiii,  t  1 )~ 1  >0. 

in  order  to  calculate  E[  1(7: |  analytically,  transfer 
functions  for  IF  r  and  b  :  are  needed.  From  (5.27) 
IF  r  satisfies 


=  E\  H7:[  where 


H 


«.!</)  = 


1  -  (A  -  K )</ 


(5.321 


where  q " 1  is  a  one-step  delay  operator  and  K  =  OG. 
Note  that  using  the  predicted  state  EKF  in  Ljung 
(1979).  IF  :  is  KH,{q):  and  for  the  filtered  state 
EKF  fl  :  is  G'i/F/;((/);.  From  the  relation  (limiting 
equation  for  (5.13)  and  (5.141) 

y,  -  fly,  -i  +  Glr,  —  fly,  - , ), 
y  :  is  obtained  as 

v  G 


1  -  (fl  -  Khi  ' 


Therefore. 


-  =  H:U,)  = 


1  -  ft,"' 


1  -  (fl  -  £)</-r 
Finally.  EJlTiij  is  obtained  from 

1 


(5.34) 


(5.35) 


Jm  =  £!»cj  =  r^.d)W,(l/-|)//:W)Cl>,.-(</)  - 


,  fl'/ 

9 

(5.36) 

where  c  is  the  unit  circle  in  the  </  domain,  and  where 

^r:(9)  =  1  +  2,  [1  —  9^0 H *  —  flo/9 ) ]  (5.37) 

where  0o  is  the  actual  value  of  the  system  parameter 
and  A  is  the  actual  process  noise  variance,  while  unit 
variance  is  used  in  the  gain  calculation  of  the  filter. 
Note  that  (5.37)  is  derived  from  (5.8)  with  y  =  1  and 
the  actual  process  noise  variance  A.  After  evaluation 
using  residue  calculus.  /„  yields 


where 


Jm,  = 


Jm:  - 


jm  jm\  + 


A(fl„  -  fl)fl„ 


(5.38) 


(I  -  /fl„)(fl„  -  I Kl  -  flr,)' 

(./  -  fl)[(i  - M,M/  -  9„)  +  /■/'] 

n  -  / 2 7(1  —  }  fl„ )  ( /  -  (f„)  ' 


(5.39) 


where  /  =  fl  —  N  -All  +  fl‘  2  +  v  fl4  4  +  1 ).  f„ 
can  be  compared  to  the  results  of  the  filtered  slate 
EKF  of  Ursin  ( 19S0)  where  of  the  EKF  satisfies  le 


=  Gfm,  (1  -  /fl„). 
/,,  =  Gfm,  (1  -./’). 


(5.40) 


Figures  1  and  2  show  the  comparison  of  l„  and  fe 
for  0t)  =  0.2,  0.4.  0.6  and  0.8  and  for  /.  =  1  and 
A  --  10.  The  results  show  that  there  is  only  one  zero 
crossing  for  both  (he  filtered  state  EKF'  and  the 
MGEKF  indicating  that  there  is  only  one  point  to 
which  the  estimator  will  converge.  Note  that  for 
A  =  10  for  mismatched  process  noise  variance,  the 
parameter  estimate  of  the  MGEKF  is  only  slightly 
more  biased  than  that  of  the  filtered  state  EKF. 
Although  the  figures  show  that  |  / J  >  |  /t.|  and 
|d/,„  dfl|  >  |d/,.  dfl|.  this  does  not  indicate  that  the 
MGEKF  has  faster  convergence  that  the  EKF.  as 
would  be  supposed  by  the  analysis  of  Ursin  (1980). 
The  local  convergence  of  the  estimators  is  given  by 
evaluating  E[U: ]  in  (5.31)  which  includes  nij  and 
not  just  by  )„  or  ie.  Although  it  can  be  argued  that 
near  the  equilibrium  point  my  is  a  constant,  its  value 
is  different  for  each  of  the  filters  and  is  obtained  from 
Ljung  (1979) 

mj 1  =  G(fl)  =  E[lT'V‘] 

=  r1-  ®  H(i i ' 1  )s“'  HUy )5t>.:(9) —  (5.41 ) 

2 *J  J,  9 

where  H(q)  is  given  by  W,  in  (5.32)  for  the  MGEKF. 
by  GcjH ,{q)2  for  the  filtered  state  EKF.  and  by 
OGH^q)1  for  the  predicted  state  EKF.  All  three 
filters  may  have  similar  asymptotic  rates  of 
convergence  since  of  the  MGEKF  is  less  than 
that  of  the  filtered  state  EKF.  which  in  turn  is  less 
than  that  of  the  predicted  state  EKF.  If  m3  is 
asymptotically  the  estimation  variance,  then  this 
indicates  that  the  MGEKF  is  locally,  relatively 
more  efficient  than  the  other  EKFs. 

5.2  Pole  identification 

Consider  the  following  example  ol  a  discrete-time 
single-input,  single-output  linear  time  invariant 
system  excerpted  from  Saridis  (1974) 


C,  +  l  —  -4  c,  +  d  w, 
-I  =  Gc,  4-  1'j 


(5.42) 


where 


'o 

1 

0 

o' 

0 

V 

0 

0 

1 

0 

1 

.  c7  = 

0 

A  = 

0 

0 

0 

1 

.  d  =  \ 

0 

0 

a  i 

a  2 

a., 

a4 

Ij 

0 

(5.43) 


Fig.  I  The  functions  and  /„l0)  for  il0  =  0.2.  0.4.  0.6  and  O  X  with 


and  the  noises  are  zero-mean  white  Gaussian  with 
finite  second  moments  such  as 

=  <\j 

E{v,y,\  =  0.25<>„. 

According  to  Saridis  (1974).  the  EKF  used  to 
identify  the  a,s.y  =  1  —  4  performs  poorly  even  for 
the  case  w  here  it  is  stable.  A  direct  application  of  the 
MG  EKF  to  the  above  system  is  infeasible,  since  the 
nonlinear  system  dynamics  are  not  modifiable  when 
the  unknown  parameters  are  augmented  t  the 
original  state  Therefore,  the  stochastic  system 
(5.42)  is  first  transformed  into  the  input  output 
transfer  function  using  the  /-transform  technique 
as 

.-(</)  q2  -  iuq  +  (I  -  «,) 

- =  7 - r —  — 7 - .  (5.44) 

vv((/|  q  —  ti^q  -  u,<r  -  u2q  -  a, 


This  form  is  consistent  w , til n 4.:  ..  Tu-  artyer.', utile 
canonical  form  (Chen.  Er;0  ;s  reaiiz;*:! 

■v, -i  =  ;i.'.  -  m. 

</i4>.H 

where 

0  0  0  </,l  •;  To' 

i  o  0  </, 

,4  = 

0  I  0  <1, 

0  0  1  „4 

!t5.4fi| 

This  is  the  form  suggescJ  ur>  i-42i  .and  tui3i  'if 
unknown  parameters  u,  i~.  a*.  .and  ,i;u  lire 
augmented  to  the  originu  -autuso^v  iimiv^, 

respectively,  then  the  autunmiudlnoniimeur  -.v.s-uem 


<jV 


M 
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0  2  Lambda 


a  ■-  0  6  Lameda  ;  10  0 


r„  ;  0  8  Lam  sdc  -  10 


FlCi.  2.  The  functions  /,.((!(  and  l„{0)  for  l>„  =  0.2.  0  4.  0  6  and  0  8  with  /.  =  10 


dvnamics  are  written  as 


=  /,( v, )  4-  ll,(  v,.  »  ,  |.  ( 5.47 ) 

F  ollow  ini:  the  arguments  of  Section  4.  the 
measurement  2,  =  x4  +•  v,.  and  /(  1  of  |5.47|  are 


modiliahle.  Note  that  the  abo\e  nonlinear  system  is 
corrupted  by  state  dependent  noise  with  second 
moment 

q,  =  /•.'!  u;(.v,.m,iu/i \,.u.i;.  is  4si 

Since  (I  -  v-J  and  v,,  111  It,  of  ( 5  4 ~  1  are  actually 
unknown  values  for  this  problem.  0,  in  t?  4 s  1  is  Him 
assumed  to  be  an  arbitrary  constant  matrix  when 
the  \KiI  Kt  is  applied,  and  l...er,  an  adaptive 
scheme,  used  to  calculate  (J,.  is  implemented  in  the 
simulation.  The  stability  analysis  in  Section  3  is  still 
valid  even  though  the  noise  model  is  ditlerent  from 
the  actual  noise  as  long  as  the  actual  noise  and  noise 
model  are  /ero-mean  processes  with  linile  seci'nd 
moments.  As  eiven  in  Section  4.  /,!:*.  \ ,  1.  obtained 


<«,v 
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simulation  of  the  errors  in  the  parameter  estimates 
with  the  matched  process  noise  statistics,  while 
Fig. 4  shows  those  of  the  100",,  mismatched  cases 
where 


(1.669)2. 

-2.36  . 

1.669 

-2.36  , 

-1.414  0 

1.669  . 

-  1.414. 

1 

0 

0 

^<ma 


is  used.  For  all  of  the  simulations,  the  initial  value  of 
true  state  is  .v/,  =  [0.0. 0,0.  -0.66.0.78.  -0.18. 1.0]. 
the  initial  value  of  the  a  priori  estimation  of  the  state 
is  xl  =  [5.  5.  5,  5.  -  1.32.  1.56.  -0.36.  2],  and  the 
initial  value  of  p,  is  pn  =  10/s.  Finally,  for  the 
purpose  of  comparison  with  the  results  of  Saridis 
(1974),  Fig.  5  shows  the  convenience  rates  of  the 


MG  IK.  I  with  Q, .  and  the  adaptive 

calculations  of  Ot.  For  adaptive  calculations  of  Q, 
the  following  Qt  is  used 


U  -  a  - . 

-  .Vs I  1  -  V-  I. 
(I  —  V  - 1  . 


-  \sl  1  -  V  -  ). 

v; 


il  -  v-l 

-  As 


The  results  here  show  a  remarkable  improvement 
over  those  reported  in  Saridis  ill174i  for  the  FKF 
using  the  stochastic  system  (5.42 1.  The  average 
normalized  parameter  error  was  reduced  by  three 
orders  of  magnitude  of  the  FKF  of  Saridis  1 19~4i 
and.  in  fact,  the  result  of  the  MG  FKF  is  equivalent  to 
the  best  parameter  identification  scheme  reported 
here. 

With  the  same  arguments  as  those  tuven  in 


Iterot  ton  (x  fO  ) 


Iteration  (  x  /O') 


I terotton  (  %  (0  ) 


I  tera  r  ,on  (  *  10' ) 


Fig.  4  The  errors  in  the  estimates  of  the  parameters  in  the  case  of  mismatched  process  noise  variance  O 
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Fit.  5  Convergence  rates  of  the  MGEKF  with  Q„,.  Q,lm  .  and  adaptive  calculations  of  Q,. 


Section  5.1.  the  EKF  applied  to  observable 
canonical  form  1 5.45 1  may  have  global  stability  since 
the  difference  between  v, )  of  the  intermediate 
MGFKF  and  <'7,  f.v,  of  the  EKF  is  small  if.v4i  -  v4, 
for  the  EKF  is  small  such  that  (5.12)  in  Section  5.1 
holds. 

6.  CONCLUSIONS 

An  exponentially  convergent  observer  called  the 
MGEKO  is  derived  for  the  problem  where  both  the 
nonhnearities  in  the  d>  namtes  and  measurement  are 
assumed  modifiable.  The  exponential  convergence 
of  the  MGEKF  is  studied  in  the  probabilistic 
Hilbert  space  L2,  by  introducing  the  exponentially 
bounded  nominal  filter  called  the  intermediate 
MGEKF.  and  sufficient  conditions  for  the  MGEKF 
to  be  globally  stable  are  obtained  from  the  analysis. 
The  intermediate  MGEKF  can  also  serve  as  a 
nominal  filter  for  the  stability  analy  sis  of  the  filtered 
state  EKF  used  for  the  modifiable  systems.  These 
results  generalize  the  work  reported  in  Song  and 
Speyer  (1985)  for  the  case  where  only  the 
measurement  function  was  nonlinear  and  modi¬ 
fiable.  The  stochastic  stability  analysis  for  the 
continuous  time  MGEKF  is  found  in  Song  (1983). 
These  generalized  results  are  now  applicable  to  the 
parameter  identification  problem  where  the 
measurement  is  linear  and  the  dynamics  are 
nonlinear  and  modifiable,  if  the  proper  coordinate 
frame  is  chosen. 

Ljung's  conv  ergence  analysis  is  applied  to  a  scalar 
parameter  identification  problem.  The  results  show 
that  the  MGEKF'  has  similar  convergence  proper¬ 


ties  as  the  filtered  state  EKF  which  differs  markedly 
from  that  of  the  predicted  state  EKF  (Ljung.  1979). 
The  MGEKF  is  applied  to  a  pole  identification 
problem  of  a  fourth  order  system  where  the  previous 
reported  results  for  the  performance  of  the  EKF  are 
poor.  By  choosing  the  observable  canonical  form  of 
the  state,  the  resulting  system  dynamics  are 
modifiable  such  that  the  MGEKF  is  readily  applied. 
Monte  Carlo  simulation  indicates  that  for  the  case 
of  matched  noise  variance  and  for  the  case  of  the 
adaptive  calculations  of  the  noise  variance,  the 
MGEKF  has  excellent  convergence  characteristics. 
In  the  case  of  mismatched  noise  variance  without 
the  adaptive  feature,  the  MGEKF  has  decent  mean 
square  error  performance  characteristics. 
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APPENDIX  1 :  PROOFS  OF  THEOREMS  1.  2  AND  3 
I.  Prnol  of  Theorem  1 

Take  the  conditional  expectation  over  f;ie*)  -  Me,*!  given 

7;*  =  1VS.eT.er....e,*-1.f,*!  as 

Er* :  Fie; I  -  r;(e,*)!  =  e,*£,* ; L:tpE  1  IT,  -  m*  - ' 

+  tr  [E,*;A*'p.*  'k*y,|  ]  (A.!; 


over  1,1c, *1  -  l,  jit**  ,  I  gtsen  I  *  ,  N  T .  ,  as 

/,*  :f,i.',*i  -  i;  ,(,■*  ,i; 

=  »■,•' ,/•;,*  .:■/!  on*  1  /,  ,  -  /’,*  , 

-  tr  ,£,.*  .  ‘o’ C  'Q,  i ‘ .  t  \  5t 

where  use  is  made  of  the  facts  that  ,  is  not  a  function  of  <  i, 
v*  |  is  a  function  of  the  past  measurements  such  that  •/.  , 
=  :/,-|ir,*  |.  x*  1 1  and  hi*  aie  independent  of <•»,  ,.and'i,  ,  is 
a  zero-mean  independent  noise  process  wiili  finite  second 
moment  O.  The  term  inside  the  first  £,  *  ,  of  the  RIIS  of  (AN) 
satisfies 

■JI-O’C  ,  -r*  V  =  -r:  I'Vf'ii  Q, 

-V  V,  ‘.p*  ‘,1  '  -/,  3  -  (I  I.A.6I 

Equation  lAhl  implies  that  there  exists  (I  -  /(,  ,i.  <  I  such 

that 

c, ,<  ,  =  pz.cr'ip*  ,.  i a. 'i 

The  existence  of//,  is  assured  hy  Assumptions  3  and  4.  Therefore. 
(AN)  becomes 

£,,*  ,;r,if,*i  -  i.  .!>',*  ,t; 

=  K  ;  -  1 1  -  p-.lE,  ,1 ;  l,  ,(.',*  ,  f  lA  M 

where  K;,  =  tr  :E|*  ,',in*  '0,  , , and  0  <  K  ..  <  \  <  /.. 
Now,  take  the  conditional  expectation  over  I A .  4 1  for  given  ),• 
and  use  the  nesting  property  of  the  conditional  expectation,  then 

£,*  ,;i;ie:i  -  f,(e,*i:  =  £,*  ;i,ic,*i  -  r.t,* »; ; 

—  K  i ,  —  1 1  —  p  i  ,)£>*.  J  ( '/ , .  |  e*  i  —  c*i,  ~  1 1  in* 

,  .  |Cf.  ,  O),  | 

=  K -II  m, *  ;e,*.  , 

-II-  Pi.iKs,  (A.9) 

where  the  fact  that  a,.  ,  is  independent  of  .  ,  and  in*  is  used. 
From  (A.9|  and  (A.X) 

£r,*  ,:»;«*',*)  -  i;- , >: 

=  k i,  -  ii  - /<;,i£v*  ,;i,  ,(»-•. 1 1; 

-tl  -  , )e,*-r, £, *  ,  |  V . , in,*  ' 1  •  V, . , ! v*.  i 

4-pi.Kj.sKj  —  <),£,  *-  i !  1 1  - 1  ( 1 1 ,  I  A.  1 0 ) 


££[■)  is  a  conditional  expectation  operator  given  >',*.£*  is  defined 
in  (3.1 7 1.  and  tr  is  a  trace  operator  for  matrices.  The  derivation  of 
IA.1 1  uses  the  facts  that  r*  is  not  a  function  of  io„  v*  is  a  function 
of  the  past  measurements  such  that  I*  in  (3. 17|  and  p,*  are 
independent  of  v„  and  r,  is  a  zero-mean  independent  noise 
process  with  finite  second  moment  Note  that  the  term  inside 
the  first  £t*  operation  of  the  RHS  of  (A  1 1  satisfies  the  following 
equation 


L:'p;  -'IT,  -  m* 


r(  S.-Ol's* '  a.  *  s*  <  0  (A.2l 


where  v,*  of  the  intermediate  MGEKF  satisties 
v*  =  .  ,/s* '  /.*  1  m*  1  The  last  inequality  of  lA.2l  comes  from 
the  tact  that  s*  e  a* ' "  is  at  most  ol  rank  </.  Equation  (A. 2)  implies 
that  there  exists  0  <  />,.  <  I  such  that 


e*  ’IT,1  p*  •'/*£,*  =  p,.e,*,«i1*  1  o*  =  p,  f;<c,*l  |A.3) 


Therefore.  (  A  1 1  becomes 


£r*; i if.-,*)  -  life,*);  =  k,,  -  ii  -  p,.i£v*:f;ii-;t:  i a.4> 


where  K,  is  defined  as  K, .  =  tr  [E)*,,k*'p,*  'k,*,,!'.  and 
0  <  K  i  <  3/  <  r  Similarly,  take  the  conditional  expectation 


whercO  <  sup  (K,  +  pi  K*,\  <  K,  <  r. .  and  a,  =  1  -  such 
oi- 

that  0  <  //.  <  <i,  <  1  The  boundedness  of  K,  is  obtained  from 
Assumptions  3  and  4.  and  the  gain  k,*  algorithm  of  the 
intermediate  MGEKF.  Note  that  Assumption  3  implies  in*  1  Is 
uniformly  bounded  from  above. 

By  applying  the  nesting  property  of  the  conditional 
expectation  to  (A.I0I.  one  can  obtain 


£»*.:  !;(*■,* »;  =  £,*.:£,* , :  i  ,u-,*  t: : 

<  K,  ( i  -  a ,)£,*;/.,*  ;  i,  ,(.-,*  1 1; 

=  k,  -T-  ( i  -  .),)£,*:  i,  iii',*  ,  i; 


Define  o  as  a  =  inf  ',0,',.  then  a  is  uniformly  greater  than  0. 
!■  /  - 

Applsinu  I  A.  1 1 1  rcairsi\cl>  results  in 


^  i 1  -  «m'  1 1  -  (,iit;,i: 


t’se  Assumption  4  and  take  an  uncomliiion;il  expectation  oxer 
in  t A  1  '  i  Then. 


or.i-  ,  +  k:\ i  _ 
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inhere  K  ,  <  Kt  ^  1 1  -  ar  c  =  K,  11)  and  K  ■_  -  I. 1  I  >(c,*l!  e 
Therefore,  the  claim  of  Theorem  1  is  proved  (q.e.d  I 

2  Proof  of  Theorem  2 

Define  a  Liapunov  function  as  Llcfl  =  e* '/>* '  ’«•*.  Alter 
some  manipulations,  using  la, 2’  I  for  each  inters al  in  k.k  —  V  . 
it  can  be  shown  that  the  conditional  expectation  of  1 , .  .(<•*.  si 
-  tk|ef  >  given  f,*  =  ;«•!!. t'T _ ef !  satisfies 

I*-'  ) 

<  -»■:'£>:•  I  c/;'.//,yM 

ti-k  ) 

T  V  tr  -rlr  [L'/rt  'I'.Q, 

/’V  *  l 


+  V  £,*]  V  T,r  v],Fl  ,u ,  J,  l  (  A  14i 

J  ■■  k  -  1  '  /  -  k  -  1  ) 

where  H.  =  ,  1 H ..  T,  =  kfv,  -  L*,r>,- 1  and  «/,, 

i- 1  ‘ 

-  (T  /*.,  :/,.  Furthermore,  e*  =  ia  ,ef  -  Tl  wl(7|  and  the 
i=i  i,-i 

fact  that  the  v,s  and  r,.s  are  zero-mean,  independent  noise- 
sequences  are  used  in  the  derivation.  Note  that  in  the  last  two 
terms  of  i  A.  14 1.  there  is  correlation  between  o. ,  and  the  random 
noises  If  Schwarz's  inequality  is  applied  to  each  element  of  the 

vector  £,!>,;  where  u,  =  ^  v>,., T,.  it  can  he  shown 

i  =  v  -  i 

that  the  £,?>,  |  is  bounded.  Since  Assumption  5  is  satisfied 
(observability  condition  I.  p2  is  bounded  from  above  such  that  L* 
is  bounded.  Note  that  the  r,s  and  <v,s  are  zero-mean  independent 
white  noises  w  ith  finite  second  moments.  Therefore,  there  exists  a 

k  -  \ 

constant  vector  iJ,  such  that  V  =  fl,.  Similarly,  if 

,  T- 1 

Schwarz's  inequality  and  the  hypothesis  of  the  boundedness  of 
fourth  moments  of  the  noises  are  applied  to  the  last  lermol  iA.I4l. 
it  is  possible  to  find  a  constant  K„  which  bounds  the  last  term  of 
IA.I4i.  Note  also  that  since  Assumption  5  is  satisfied,  the 
argument  of  the  expectation  operator  of  the  first  term  of  the  RHS 
of  (A. 14)  is  strictly  uniformly  positive  definite.  The  terms 
involving  the  tr  operator  in  (A.  14)  can  be  written  as 

tr  [;•;  '//,/>;//;  ]  +  tr  [pr  -  2H,rv/  ' II, Q,  , 

+  H),;' H,pjr (.4.15 1 


Then,  from  Lemma  I  in  Section  .V  trip*  'Qi  ,  i  of  (A  I5i  in 
uniformly  hounded  from  above.  Therefore,  the  K  US  of  (A  Mliuss 
the  form  of -t'*'  1  o'*  +■  A’-  -  2c*  ‘  IT.  where  !;  ji  I  .-(bind 
A,<  A]  <  a.  furthermore,  there  exists  a  matrix  (  ->  i  I  -  o 
for  some  i  >  0  and  a  constant  0  <  K  -  /  .  lor  ail  kt  /  .  such 

that 

-ef'.V?  “  ct'Kk  <  -cf'Cc?  -  K  ( A. 1 6» 

If  we  denote  the  index  k  as  i  -  I.undA  +  \  as  t  *  1.  then  it  iseusy 
to  see  that  iA.I4i  satisfies  lA.lUi.  Ilieretore.  (A  I2i  can  al>o  he 
obtained  The  rest  of  the  proof  i>  the  same  .is  that  of  Theorem  I 

(q  e  d  I 

3  Pnmt  of  Theorem  } 

Introduce  a  primary  Lyapunov  function  tor  the  MCiLKL  as 

Ll cd-c/p;  v,  t  A  I  7 1 

where  p*  1  is  hounded  from  below  by  Assumptions  4  or  5  and  p* 
is  a  function  of  :2  and  v*  such  that  />,'  is  not  correlated  with  i 
Now  a  secondary  Lyapunov  function  lor  the  NKiFKI  is 
introduced  as 

l.le,  i  =  e,  'up  1 « ,  I A  I  x  i 

where  m2  is  a  function  of  :*  ,  and  v*  ,  such  that  m2  is 
independent  of  ,.  Take  the  eotuhtion.il  evpeeiation  over 
Lie, I  -  r.lc.l  for  given  i  =  .«•  ,  Then  Iron)  the 

hypothesis  13.35 1 

Ef,\  Lie,)  -  Lie, ll  =£,.-(!  -  /»,  i /o  ;  C,i<  I  l  A  Id  I 

where  kK  is  defined  as 

K, .  =  tr i ;;  -  .u, i '/•,*  ‘i/k*  -  .u  i. , .  i  \:oi 

and  K  ,  is  bounded  from  above  by  the  hypothesis  oil  he  theorem 
Note  i  hat  I  A.  19 1  is  similar  to  l  A. 4 1  of  Theorem  I  Sinul.n  ly.  if  the 
hypothesis  1 3.36 )  is  satisfied,  ihe  conditional  evpeeiation  over 
Lie,)  -  I,  1 1 1‘,  - 1 )  for  given  >,  I  =  ; f,i. I . . , .  e,  ,  j  satisfies  an 
equation  similar  to  l  A. Si.  The  remainder  of  the  proof  is  the  same 
as  that  given  in  Theorem  I 

Iq  ed  i 
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ABSTRACT 


A  maximum  likelihood  estimation  method  is  developed  for  a  class  of 
problems  where  the  dynamics  are  linear  and  the  measurement  function  is 
nonlinear.  In  this  method,  called  the  assumed  density  filten  ADF),  the  form 
of  the  condiuonal  probability  density  function(CPDF)  is  selected  to  be  a 
funcuon  of  a  finite  number  of  quanuties.  These  quantities  which  describe 
the  approximate  shape  of  the  CPDF  around  the  mode  are  propagated 
through  each  measurement  interval.  At  the  measurement  the  CPDF  is 
updated  using  Bayes  theorem  and  its  mode,  computed  numerically,  is 
defined  to  be  the  best  estimate  of  the  state.  The  postenon  CPDF  is  then 
approximated  by  a  Taylor  senes  expansion  about  its  mode  to  preserve  the 
assumed  functional  form.  The  numencal  results  for  a  target-intercept 
problem  indicate  that  the  ADF  is  superior  to  the  extended  Kalman  filter. 
Howe- er  the  ADF  has  a  negative  range  bias.  It  is  analytically  proved, with 
some  approximations,  that  the  maximum  likelihood  range  estimates  are 
smaller  than  the  mean  range  estimates. 


/  INTRODUCTION 

Tactical  weapon  systems  require  accurate  tracking  of 
maneuverable  vehicles  such  as  submarines  and  airplanes.  Dunng  the 
last  several  years,  there  has  been  an  active  interest  in  the 
development  of  sophisticated  filtering  algorithms  for  tracking  with 
beanngs-only  as  the  observations.  Mathematically  .this  problem  can 
be  described  in  an  inertial  rectangular  coordinate  frame  by  a  linear 
dynamical  model  and  a  nonlinear  discrete  observation  model  or  in 
an  inertial  polar  coordinate  frame  by  a  nonlinear  dynamical  model 
and  a  linear  discrete  observation  model.  Satisfactory  results  for  this 
class  of  problems  have  been  difficult  to  obtain  using  current 
mechanizable  filters  because  of  the  nonlinearity  and  the  passive 
nature  of  the  observations.  As  a  result,  considerable  research  has 
been  going  on  to  improve  the  existing  methods  in  order  to  obtain 
better  estimates. 

2  APPROXIMATIONS  IN  NONUN  EAR  FILTERING  THEORY 

The  target  tracking  problem  is  stochastic  in  nature. 
Analyses  of  stochastic  problems  are  possible  through  statistical 
interpretations.  In  order  to  obtain  mathematical  expressions  for  the 
statistics,  assumed  to  represent  the  best  estimates  of  the  states 
associated  with  a  problem,  knowledge  of  the  underlying  probability 
density  function  (PDF)  is  essential. 

If  the  system  dynamics  and/or  the  measurement 
function  are  nonlinear.a  finite  set  of  statisucs  sufficient  to  describe 
the  conditional  probability  density  funcuon  (CPDF)  is  not  available 
(1,2,3).  Even  if  the  initial  states  and  the  process  noise  are  assumed 
Gaussian.th*  nonlinear  dynamical  system  results  in  a  non-Gaussian 
CPDF  Second. the  propagation  equation  for  the  conditional  mean 
consists  of  expectations  of  nonlinear  functions  that  arc  very  difficult 
to  evaluate.  Third.since  the  CPDF  is  not  Gaussian,  the  system  ot 
equations  to  describe  the  conditional  moments  and. hence. the  filter 
as  a  whole  becomes  coupled  and  inlimte-dimensional. 
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To  circumvent  these  difficulties.approximations  have 
been  attempted  to  realize  estimation  methods  consisting  of  a  finite 
number  of  equations.  The  Edgeworth  series  expansion  has  been 
used  by  Sorenson  and  Stuberrud(4)  in  developing  a  finite  - 
dimensional  filter  for  a  discrete  scalar  system.  The  Edgeworth 
series  (5)  is  described  by  an  asymptotic  expansion  about  a  kernel 
and  it  consists  of  Hermite  polynomials  (5)  and  their  coefficients  are 
given  by  quasi-moments  which  are  related  to  the  central  moments. 
Sorenson  and  Stubberud  have  chosen  a  Gaussian  kernel  for  their 
applications  and  approximated  the  CPDF  by  an  Edgeworth  senes 
truncated  after  the  fourth  term.  Thus.the  first  four  quasi-moments 
define  the  CPDF.  The  problem  of  expectations  of  nonlinear 
functions  in  the  propagation  equations  has  been  handled  by  senes 
expansions.  The  state  and  measurement  equations  were 
approximated  with  second-order  perturbauons  using  a  Taylor  series. 
Due  to  the  nonlinear  term  in  the  measurements, the  update  equations 
across  the  measurements  become  very  involved  requinng  further 
approximations.  Note  that  the  filter  developed  in  this  paperdoes  not 
have  such  approximations  for  the  nonlinear  measurements. 
Furthermore.Sorenson  and  Stubberud  have  reported  that  the 
approximations  associated  with  the  nonlinear  measurment  term  are 
of  "critical  importance"  to  stabilise  the  behavior  of  the  second  and 
the  fourth  moments  and  thereby,  to  the  performance  of  their  filter. 

A  similar  method  of  parametrizing  the  CPDF  has  been 
reported  by  Willsky  (6).  He  has  used  the  study  of  random  processes 
on  the  circle  effectively  to  formulate, using  Fourier  series.a  variety 
of  nonlinear  estimation  problem^  arising  in  the  field  of 
communications.  He  has  discussed  a  few  finite-dimensional 
approximations  for  a  scalar  continuous  time  problem  (6).  The 
results  of  approximating  the  CPDF  with  the  first  three  coefficients 
of  Fourier  series  when  applied  to  a  phase-tracking  problem  were 
found  to  be  very  poor.  The  reason  according  to  Willsky  was  that 
the  truncated  terms  of  the  senes  might  not  have  been  negligible. 
This  was  shown  by  an  example  assuming  perfect  knowledge  of  the 
phase. 

An  alternate  method  to  completely  neglecting  the 
higher-order  moments  or  coefficients  is  discussed  by  Kushner  (3). 
Instead  of  truncabon  of  the  higher  order  moments,  he  has  devised  a 
method  to  replace  them  with  lower  order  moments.  The  method, 
called  the  ‘moment  sequences',  involves  picking  an  ‘n'  parameter 
moment  approximation  to  the  CPDF.  When  moments  of  order 
higher  than  *n’  are  encountered,  they  can  be  computed  in  terms  of 
the  first  *n'  moments  resulting  in  a  better  approximauon  than 
assuming  them  to  be  zero.  Kushner  has  discussed  the  conditions  to 
be  sauslied  for  picking  the  moment  sequence  for  a  scalar  problem 
It  is  not  clear  from  his  paper  as  to  how  the  moment  sequences  could 
be  picked  for  a  general  multidimensional  problem. 

Some  other  formulations  of  parametrizing  the  CPDF 
with  moments  have  been  reported  (1,7).  An  approximation  has 
been  made  by  assuming  the  CPDF  as  Gaussian  and  neglecting  the 
even  moments  of  order  higher  than  four.  The  system  and 
measurement  nonlincanties  in  this  method  are  carried  to  second- 
order.  The  resulting  filter  is  known  as  the  Gaussian  second-order 
tiller  (1,7).  A  slightly  different  version  of  this  filter  has  been 
derived  assuming  that  the  CPDF  is  almost  symmetric  and 
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concentrated  near  its  mean.  Such  a  basis  allows  for  ignoring  the 
fourth  central  moment.resulnng  in  what  is  termed  as  the  truncated 
second-order  filter  (1,7).  The  basic  difference  between  the  two 
filters  is  in  the  propagation  equation  for  the  second  moment.  The 
modified  second-order  filter  (1)  is  the  version  of  these  two  filters 
without  the  measurement  term,  which  contains  random  noise, 
included  in  the  covariance  equaton.  This  is  done  in  order  to  prevent 
the  covariance  from  taking  negative  values.  The  most  popular 
method  in  the  applications  of  nonlinear  filtering  theory  is  the 
extension  of  the  Kalman  filter  methodology  (1,8),  which  is  optimal 
for  linear  systems,  to  nonlinear  problems  through  linear  perturbation 
theory,  rhe  resulting  filter  is  called  the  extended  Kalman  filter  (1). 
However,  it  is  not  known  how  the  statistics  of  the  CPDF  relate  to 
the  extended  Kalman  filter  (EKF).  Simulation  results  for  a  scalar 
problem  using  the  second-order  filters  discussed  here  and  the  EKF 
have  been  reported  by  Schwartz  and  Stear  (7).  Their  results  showed 
no  particular  merits  of  any  second-order  or  any  distinct  superiority 
of  the  second-order  filters  over  the  EKF. 

Another  technique  to  approximate  the  CPDF  involves 
the  idea  of  cumulants  (6,9).  The  advantage  of  the  cumulants  over 
the  moments  is  that  while  the  higher  order  moments  may  not  tend  to 
zero  ,it  is  reasonable  to  assume  that  the  higher  order  cumulants  tend 
to  zero.  Nakazimo  (9)  has  assumed  a  Gram-Charlier  expansion  (5) 
for  the  CPDF  characterized  by  cumulants.  He  has  also  derived  the 
dynamical  equations  for  the  cumulants  for  a  nonlinear  continuous 
time  problem  that  are  infinite-dimensional  in  nature  and  discussed 
finite-dimensional  approximations  by  truncation.  Willsky  (6)  has 
referred  to  the  possible  approximation  of  the  CPDF  using  cumulants 
for  the  nonlinear  estimation  problems  in  communication  theory. 
Both  have  not  discussed  any  numerical  results. 

3  A  NFW  FILTER  FORMULATION  AND  ITS  RElATIO\  TO 
PREVIOUS  WORK 

All  the  approximate  filters,  described  in  Section  2, 
claim  to  estimate  the  conditional  mean.  A  major  difficulty  in  the 
estimat;  n  of  the  conditional  mean  is  the  computation  of  the 
normalizing  constant  of  the  CPDF  at  the  measurement  update  for 
the  nonlinear  problem  (1).  In  this  study, new  filter  structures  which 
are  more  complex  but  mechanizable  are  proposed  The  conditional 
mode  is  assumed  to  be  the  closest  representation  of  the  state,  thus, 
eliminating  the  normalizing  constant  from  the  computations. 
Also.there  is  no  approximation  to  the  nonlinear  measurement 
function  as  in  the  EKF.  The  basic  idea  is  to  choose  the  form  of  the 
CPDF  to  be  a  function  of  a  finite  number  of  quantities  and  to  project 
these  quantities  through  each  measurement  interval.  These 
quantities  describe  the  approximate  shape  of  the  CPDF  around  the 
mode.  The  vector  which  maximizes  the  approximate  posteriori 
CPDF  at  a  measurement  is  defined  to  be  the  best  estimate  of  the  true 
state  This  method  is  referred  to  in  this  paper  as  the  assumed  density 
filter  (ADF).  The  ADF  is  applicable  to  a  class  of  problems  where 
the  dynamics  are  linear  and  the  measurement  functions  are 
nonlinear.  The  equations  that  define  the  ADF  are  developed  and  the 
ADF  is  applied  to  a  homing  missile  problem.  The  results  are 
explained  and  the  inherent  biases  in  certain  formulations  are 
indicated  The  performance  of  the  ADF  is  also  compared  to  the 
widely-used  EKF. 

3  I  Assumed  Form  of  The  Density  Function 

In  this  section,  an  assumed  form  of  the  unnormalized 
CPDF  is  presented.  Then,  the  procedure  for  processing 
measurements  is  developed,  and  the  equations  for  propagating  the 
quantities  required  for  measurement  processing  are  discussed.  To 
avoid  having  to  develop  a  theory  for  three-dimensional  matrices,  a 
mixed  matnx-indicial  notation  is  used.  Hence,  throughout  thus 
section,  a  repeated  index  denotes  summation.  The  equations  are 
developed  in  rectangular  coordinates  where  the  propagation 
equation  for  the  approximate  conditional  mode  is  linear. 

There  are  two  considerations  in  the  selection  of  the 
form  of  the  PDF  First,  the  density  function  should  be  a  function  of 
a  finite  number  of  parameters  Second,  the  functional  form  of  the 


density  should  be  preservahle  during  the  processing  ut  a 
measurement  These  conditions  can  he  accomplished  by  puking  an 
exponential  form  for  the  density  function  and  by  writing  the 
argument  of  the  exponent  in  the  form  ol  a  Taylor  senes  aNiut  the 
mode  and  neglecting  terms  higher  than  a  predetermine.:  order 
Hence,  the  assumed  form  of  the  PDF  ol  an  n  stale  vector  x  is  given 
by 

p(x.t)  =  Ci  exp  |— /(x.r)|  (I) 

where  C  i  is  a  normalizing  constant  and  tlx.tl  is  a  non-negative 
valued  function  of  x.  The  function  ftx.t)  is  assumed  expandable  in  a 
Taylor  senes  about  the  mode  of  the  PDF,  m,  and  expressed  as 

/  (t.r)=/  (m  ,r  )+f,(m  ■< )( x-m  )T/U(m,r 

+-jp(x-m  )T  f  w{m  ,i  )(x,-m;)(x  -m  )  (2) 

+-jp(x  -m  )T/uu,..( m  ■'  )(■ xrm,  ~m  )  + 

Here,  for  example,  the  term  f ^  denotes  the  partial  derivative  of 
the  matnx  /„  with  respect  to  the  component  state  x,  Also,  note 
that  f  and  its  denvatives  are  evaluated  at  the  mode  and  are  functions 
of  the  time,  as  is  the  mode. 

The  development  of  the  ADF  requires  the  equations  for 
processing  a  measurement  and  the  equations  for  propagating  the 
terms  /  ./,,/tt,  etc.,  to  the  next  measurement  time  However,  at 
the  mode,  p,  =  0  .so  that  /,(m,r)= 0  everywhere  In  addition,  it 
turns  out  that  the  term  f(m,t)  does  not  have  any  effect  on  the 
processing  of  the  measurements:  hence,  it  dees  not  have  to  be 
propagated  between  measurements. 

The  remaining  items  needed  for  the  development  of  the 
filter  are  the  system  dynamics  and  the  measurement-state  relation 
The  system  is  assumed  to  be  linear  Hence,  the  dynamics  are 
governed  by  the  equation 

x(r )  =  Fx(t )  +  b(t )  +  w(r )  (?) 

where  x  is  the  n-state  vector,  F  is  n*n  matnx  of  constants,  b  is  a 
nme-varying  n-vector  control,  and  w  is  a  Gaussian  zero-mean 
white-noise  process  w-ith  a  constant  power  spectral  density  Q  and  t 
denotes  the  time.  The  relationship  between  the  measurement  and  the 
state  is  represented  by  the  vector  equation 

z,  =fi(x,)  +  v,  (•?) 

Here.z,  is  the  p-vector  measurement,  h  is  the  p-vector  known 
nonlinear  functions  of  the  state  X/,  v,  is  a  p-vector  Gaussian  zero- 
mean  sequence  of  random  variables  with  variance  V,  and  the 
subscript  t  denotes  the  time  at  which  the  measurement  is  made 

3.2  Update  Equations 

The  approximate  CPDF  pnor  to  the  measurement  z,  is 

given  by 

p(x,  /Z , )=C  2i  exp  (-77  (x  -m  )Tfa(x-m) 

-  ~(x-m)T f^x-m^x-m) 

-  -p-(x-m  )r/m,x.(Xy-m;)(xj-m*)(x-m  )“•■)(  (5) 

4!  _  _ 

where  C2  =  C|  exp(-f)7n  is  the  apnon  mode  the  bar  denotes 
quantities  evaluated  at  m  ,  and  Z/_i  denotes  the  measurement  history 
upto  1-1. 

In  processing  the  measurement  z,.  the  approximate 
conditional  density  function  is  updated  using  Bayes  theorem  which 
le  ads  to 

p(xilZt)  =  C  3/exp[-/  (r  ,z )/)]  (6) 

where  Cx  is  the  posteriori  normalizing  constant  which  is  not  a 
function  of  x  but  only  of  Z,  and  where 
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At  this  point.  Eq  (6)  is  maximized  with  respect  to  t(  to 
obtain  the  posteriori  mode  m,  which  is  the  maximum  likelihood 
estimate  ot  the  stjte.  For  some  problems,  the  maximization  can  be 
earned  out  analytically.  however.if  this  is  not  possible,  a  numerical 
method  such  as  the  Newton-Raphson(  10) 

method  must  be  employed.  Note  that  the  mode  does  not  depend  on 
C  v  f  inally,  the  posteriori  conditional  density  function  is  expanded 
in  a  Taylor  senes  about  the  postenon  mode  to  obtain 

P 1 *  4>  =  l'4l«p|— vptx-m  »r/u(l-m  ) 


-  |,  (8) 

where  C4/  =  C  \,ex p(-f  )  and  the  carat  denotes  a  quantity  evaluated 
at  rn  Note  that  the  derivatives  of  f  consist  of  the  measurement 

The  functional  form  of  the  density  function  after  the 
measurement.  Eq.(S)  ,  after  the  approximations  is  the  same  as  that 
before  the  measurement.  Eq.(5).  ffence,  the  functional  form  has 
been  preserved. 

3  3  Propagation  Equations 

The  differential  equations  for  propagating  the  mode  and 
the  necessary  derivatives  of  f  are  derived  by  repeated  differentiation 
of  Eq.(  1 ),  Kolmogorov's  equation  ( i 

Pi--P  tr  (F)-p/(Fx+b)  +  ^QltpUMi,  (9) 

and  by  use  of  the  fact  that  px  =  0  at  the  mode.  The  argument  t  of  x 
.pp,  and  pa  have  been  dropped  for  convenience.  For  a  Gaussian 
probability  distribution,  the  derivative  /„  is  the  inverse  of  the 
covariance  matrix.  Hence,  to  be  able  to  compare  the  results  of  the 
ADF  with  those  of  the  EKF,  the  equations  for  propagating  / a  is 
replaced  by  an  equation  for  propagating  fa~l=P.  Only,  the 
equations  through  /m  are  presented  because  this  is  the  highesr- 
order  denvative  present  in  the  differential  equation  for  the  mode. 

In  view  of  the  above  discussion,  the  equation  for 
propagating  the  mode  is  given  by 


m  =  Fm  +b  —  —P  (Q/kfItIiXjr 

Then,  the  equations  for  propagating  P  can  be  written  as 

P  -  fxxx.  Pr,Q,Ju,.x.  +FP+PPT 


f  x,x*xx)\P  ,  PQjdut, 


Note  that  P  must  be  inverted  in  order  to  obtain  the  fa  terms 
needed  to  perform  this  integration.  Finally,  the  differential  equation 
for  ! ax  is  the  following: 

f xxx, "f'f  iu F If  / IXX.Q ikj +  ^1,1 


+/  L-lJ  X,  I,  +/  X  .if  X,X  f  XXX.  I,  )P^Q,J  taJt  X.  • 


r  =  1.2...../I 

where  the  fifth-order  terms  have  been  discarded. 

The  boundary  conditions  for  the  propagation  equations 
are  the  values  of  m  JP  and  f  ^  obtained  alter  processing  the  last 
measurement.  These  propagation  equations  are  then  integrated  upto 
the  _  next  _  measurement  time  to  obtain  the  values  of 
m  ,  P  ,  and  f  xxx.  needed  for  the  next  measurement  update. 

3  4  A  Second-Order  Assumed  Density  Filter 

A  second-order  ADF  is  developed  in  order  to  to 
illustrate  the  adaptive  nature  of  the  ADF.  Note  that  in  the 
terminology  ot  the  ADF.  the  EKF  is  a  second-order  filter  because  of 
the  linearized  measurement-state  relation  and  the  assumption  ot  a 
Gaussian  probability  distribution. 

If  the  Taylor  senes  expansion  is  truncated  after  the 
second-order  terms,  the  exponent  of  the  postenon  density  function, 
Eq.(7),  becomes 

f(*)  =  ^(.:-h)TV~'U-h)+  ~(x-m)rP'\x-rn )  (13) 

where  the  substitution  /u  s/,‘l  has  been  made.  Then,  the 
postenon  mode  occurs  when/f  =  0  or  when 

+  (ar-m)r/ir_,l,_  =0  (14) 

Finallv,  if  Eq.(  13)  is  expanded  in  a  Taylor  senes  about  m  and  terms 
higher  than  second-order  are  dropped  (see  the  discussion  before 
Eq  (8))  ,  the  following  result  is  obtained. 

P-\h,TV-'h,-Ul-ht)V-llkhtXM+p-'\-\mM  05) 

Here,  the  subsenpts  on  V'1  refer  to  the  elements  of  V,_l  Also.  P  , 
which  is  the  inverse  of  the  curvature  at  the  likelihood  point,  must  be 
positive  definite. 

With  regard  to  the  propagation  equations,  Eqs.(10)  and 
(11) reduce  to 

m  =  Fm  +  b  (16) 

P=FP+PFt  +  Q  (17) 

The  imtial  conditions  are  obtained  from  Eqs.(14)  and  (15). 

Eq .(15)  points  out  the  adaptive  nature  of  the  ADF.  The 
second-order  term  h ^  allows  the  ADF  to  adapt  to  the  measurement 
residuals.  This  feature  can  be  very  useful  when  the  measurement 
uncertainty  is  inaccurately  modeled  or  when  the  states  ate 
inaccurately  initialised. 

4.  HOMING  MISSILE-INTERCEPT  PROBLEM 

A  specific  application  of  interest  for  the  filtering 
technique  liscussed  in  Section  3  is  in  the  estimation  of  the  states  of 
a  homing  missile  relative  to  a  target  and  the  target  acceleration.  A 
six-degree-of-freedom  computer  program  (11),  which  simulates  the 
interception  of  a  maneuvering  target  by  a  bank-to-tum,  short-range, 
air-to-air  homing  missile  has  been  used  to  test  the  ADF  and  the 
EKF  The  guidance  scheme  to  compute  the  commanded  missile 
acceleration  has  been  based  on  an  'optimal'  linear  guidance 
law(  12). 

The  launch  geometry  used  in  this  analysis  is  described 
in  Figure  I  For  this  inertial  system,  the  Z/  axis  is  directed  towards 
the  earth's  center,  the  Xt  axis  is  aligned  parallel  with  the  missile's 
initial  launch  direction,  and  the  Y )  axis  is  chosen  to  make  the 
inertial  system  nght-handed.  The  engagement  geometry  used  in  this 
analysis  is  characterized  by  the  initial  conditions:  range.  3000  feet: 
altitude,  10,000  feet;  aspect  angle  (0„),  120  degrees,  and  off- 
boresight  angle  (0fc ),  0.0  degree.  The  measurement  and  the  state 
noise  models  and  their  statistics  are  the  same  as  in  (13).  The  initial 
state  covariance  are  the  same  as  in  ( 14)  The  number  of  Monte  Carlo 
trials  is  ten.  A  second-order  ADF  is  used  in  all  the  numerical 


experiments. 
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In  the  first  attempt  to  validate  the  ADF  using  the 
selected  engagement  geometry,  the  ADF  fails  to  converge  near  the 
end  of  the  trajectory  because  P,  defined  as  the  approximate 
covariance  matrix,  becomes  indefinite.  In  trying  to  determine  why 
the  ADF  behaves  this  way,  it  is  found  that  towards  the  end.  the 
residuals  become  large.  This  phenomenon  occurs  because  the  true 
range  vector  and  the  estimated  range  vector  have  their  components 
in  different  quadrants.  Although  the  error  in  the  range  is  not  very 
large,  the  difference  between  the  meaurement  which  is  made  on  the 
true  trajectory  and  the  filter-computed  measurement  is  of  the  order 
of  180  degrees.  Consequently,  the  optimization  process  does  not 
converge.  The  optimization  process  is  terminated  on  the  condition 
that  the  changes  to  the  azimuth  and  eievanon  angles  become  smaller 
than  10-6  .  At  this  point,  the  curvature  of  the  likelihood  function 
need  not  be  positive-  definite  and  as  a  result  P  defined  as  the  inverse 
of  the  curvature  is  also  non-positive  definite.  Note  that  if  the 
residual  dependent  term  is  not  included  in  the  updating  of  P,  it 
always  remains  posinve-definite  as  in  the  case  of  the  EKF.  In  fact, 
in  the  formulations  of  the  Gaussian  and  truncated  second-order 
filters  (1),  this  random  term  is  dropped  in  order  to  avoid  the 
covariance  from  becoming  non-positive  definite.  In  this  analysis, 
however,  the  random  term  is  retained  so  that  it  could  add  more 
information  to  the  the  covariance.  To  circumvent  the  convergence 
problem,  the  measurements  that  cause  large  non-converging 
residuals  are  discarded.  The  underlying  reasoning  is  that  if  a 
measurement  causes  P  to  become  indefinite,  then  it  increases  the 
uncertainty  of  the  estimates  which  P  represents.  Consequently,  it 
does  not  help  the  filtering  process  and  .therefore,  is  ignored.  The 
propagated  states  and  the  covariance  are  used  to  continue  to  the  next 
measurement  time. 

The  error  histories  of  the  ADF  and  the  EKF  for  various 
conditions  are  given  in  Figuris  2  through  8.  The  error  is  defined  as 
the  difference  between  the  magnitudes  of  the  true  and  the  estimated 
range  vectors.  The  magnitude  of  the  estimated  range  is  obtained  by 
averaging  over  the  ten  Monte  Carlo  runs. 

The  range  errors  for  the  nominal  case  are  given  in 
Figure  2.  The  ADF  tracks  better  than  the  EKF  during  most  of  the 
flight.  However,  it  exhibits  a  negative  range  bias.  The  range  error 
history  when  the  initial  range  has  perturbanons  of  500  feet  in  the 
positive  and  negative  directions  are  given  in  Figures  3  and  4 
respecuvely.  In  both  cases,  the  performance  of  the  ADF  is  superior 
to  that  of  the  EKF. 

Since  the  filter  does  not  know  the  actual  measurement 
noise  variance,  a  measurement  mismatch,  defined  as  the  ratio  of  the 
actual  value  to  the  assumed  measurement  covariance  in  the  filters  is 
hypothesized  in  generating  the  measurements.  The  range  errors  of 
the  simulations  with  a  measurement  mismatch  of  0.1  is  presented  in 
Figure  5.  The  performance  trends  of  both  filters  are  similar  to  the 
nominal  case  in  Figure  2  except  that  the  ADF  performs  better 
during  the  later  part  of  the  flight.  For  the  higher  mismatch  of  ten.  the 
performance  of  the  ADF,  as  can  be  observed  from  Figure  6,  is  much 
worse  than  that  of  the  EKF. 


posteriori  state  error  covanance.becomes  indefinite  Investigation 
of  the  history  of  the  range  errors  shows  that  the  failure  of 
convergence  is  always  preceded  by  the  filler  computed  range 
approaching  zero  when  the  actual  range  is  far  from  zero  'Dus 
phenomenon  leads  to  the  conjecture  that  there  might  be  a  negative 
bias  in  the  range  computed  by  the  ADF. 

This  section  deals  with  the  analysis  of  the  bias  in  the 
maximum  likelihood  estimates  of  the  states  of  a  system  Two 
inequalities  relating  the  ranges  from  different  methods  are 
established.  They  are 

1) Rm<R  (18) 

where  Rm  is  the  unconditional  maximum  likelihood  range  estimate 
of  two  Gaussian  variables  x  and  y  and  R  is  the  unconditional  range 
obtained  from  the  same  Gaussian  distribution. 

2) R\<R,<Rm.  (19) 

where  Rn  is  the  conditional  maximum  likelihood  range  of  two 
random  variables  x,  and  y,  with  a  CPDF  p{x,,y,/Z ,)  ,  R,  is  the 
conditional  mean  of  the  same  CPDF  and  RMt  is  the  conditional 
maximum  likelihood  range  of  p(Rl,Q,IZ,)  ,  the  CPDF  of  {R,0  ) 
obtained  from  the  CPDF  of  (x.y). 


5.1  Relationship  Between  The  Mean  Range  And  The  Maximum 
Likelihood  Range  For  An  Unconditional  Density  Function 

The  relative  magnitudes  of  the  mean  range  and  the 
maximum  likelihood  range  of  the  two  Gaussian  random  variables  x 
and  y  are  compared  in  this  section.  Expressions  for  the  mean  range 
and  the  maximum  likelihood  range  are  obtained  and  Minkowski’s 
inequality  (18)  is  used  to  prove  that  the  mean  range  is  always  equal 
to  or  greater  than  the  maximum  likelihood  range. 

Assume  that  the  joint  PDF  of  two  random  variables  x 
and  y  is  given  by 


p(x,y)  =  C  exp\-j(x-Z,y-y)P~'  XJL 
z  y—\ 


(20) 


where 

x  ,£_=  the  means  of  x  and  y  respectively 

P  =  and  the  state  error  covariance  matnx  of  x  and  y 
C  =  a  normalising  constant. 

The  maximizauon  of  the  joint  PDF  with  respect  to  x  and  y  leads  to 
the  minimization  of  the  negative  pan  of  the  argument  of  the 
exponent,  called  the  likelihood  function,  L. 


L  s (x-x.y-y)P'1 


x—x 

y-y 


(21) 


For  comparison  and  later  use.  the  likelihood  function  L  is  defined 
in  terms  of  the  range,  R,  and  the  azimuth  angle.0  ,  as 


L  =(R  cosO-x  Jt  sin0-y )  P  1 


R cosO-x 
/?sin0-y 


(22) 


To  simulate  actual  situations  where  uncertainties  in 
both  the  states  and  the  noise  statistics  can  occur,  experiments  are 
made  with  perturbations  to  the  initial  states  and  measurement 
mismatches.  The  range  error  history  with  an  initial  state  error  of  500 
ft  and  a  measurment  mismatch  of  0.1  is  presented  in  Figure  7.  With 
the  same  inual  error,  and  a  measurement  mismatch  of  ten,  the 
results  in  the  range  errors  are  given  in  Figure  8.  The  ADF  tracks 
better  than  the  EKF  in  both  cases.  From  these  numerical 
experiments,  it  can  be  seen  that  the  ADF  shows  a  high  sensitivity  to 
a  higher  measurement  mismatch.  This  can,  however,  be  alleviated 
by  adaptively  estimating  the  measurement  noise  covariance  (15,16). 
The  EKF  performs  well  when  the  noise  statistics  are  uncertain. 
However,  its  response  to  imperfect  initial  conditions  is  poor. 

5.  DIASES  IN  THE  MAXIMUM  LIKELIHOOD  FILTERS 

As  discussed  in  Section  4,  the  ADF  fails  to  converge 
near  the  end  of  the  trajectory  because  P, defined  as  the  approximate 


where  R  =  (x2+y  2)l/2  and  0  =  tan~'(y  lx ). 

The  values  for  the  range,  Rm, and  the  azimuth  angle,0„  .that 
minimize  L  are  obtained  by  setting  the  partial  derivatives  of  L  with 
respect  to  R  and  0  to  zero,to  yield 

xm  =Rmcos6m  =x  and 

>'„  =/?msin0m  =y  (23) 

where  xm  andy„  are  the  values  of  x  and  y  which  minimize 
L.Also, 

=(r„2+y„!)l'!  (24) 

.  The  bias  in  the  maximum  likelihood  estimate  of  the 
range  Rm  is  shown  by  companson  with  that  of  the  mean  value  of 
die  range  R  which  is  given  by 

R  =  £  \R  I  =  J  j  (X 2+y  2)h2p ( x  ,y )  dx  dy  (25) 
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where  E(.)  is  the  unconditional  expected  value  operator. 
Minkowski's  inequality  (18)  can  be  used  to  state  the  following 
inequality. For  x  juy  , 


(}  \(x1^y1)v'2p(x,y)dx  <fy)2>(  \  j  x  p  (x  ,y )  dx  dy  )2 


+  (  |  j  v  P  (*  ,y )  dx  dy  )2  .  (26) 

The  terms  on  the  right  hand  side  of  Eq.(26)  are  the  mean  values  of 
x  and  y.  That  is, 

jx  p(x,y)dx  dy  *Elx  }  =  x  (27) 

and 

/  V  p  (•*  ,y  )  dx  dy  B  £  [v  )  =  y.  (28) 

With  the  aid  of  Eqs.(26),(27),and  (28),the  inequality  statement  can 
be  rewntten  as 


(£[R])2>J2  +  y:  =  /?m2. 


(29) 


Since  the  range  is  always  positive.Eq.(29)  implies  that 
R=E[R\>Rm>  0.  (30) 


5  2  Approximations  To  The  Posteriori  Conditional  Density 
Function 

With  the  assumption  that  the  measurement  noise 
variance  is  very  small  (a  reasonable  practical  assumption)  the 
conclusion  of  Eq.  (30)  is  shown  to  be  valid  for  the  posteriori  PDF 
conditioned  on  the  measurement.  In  order  to  compute  the 
conditional  mean  range,  an  expression  for  the  posteriori  CPDF  is 
required.  Assuming  that  the  apriori  CPDF  of  x,  andy,  and  the 
measurement  noise  distribution  at  time  i  are  known,  the  posteriori 
CPDF  can  be  computed  using  Bayes  rule.  In  doing  so,  however,  the 
evaluation  of  p(z;/Z,_\),  an  integral  which  is  a  normalising  constant 
is  difficult.  Consequently,  Laplace’s  technique  (17)  is  used  to 
approximate  p(z,/Z,_|)  with  the  assumption  that  the  measurement 
variance  V  is  very  small.  To  facilitate  easier  computation  of  the 
conditional  mean  range,  the  CPDF  is  obtained  in  polar  coordinates 
R  and  0. 

Assume  that  the  apriori  joint  PDF  of  the  state  variables 
x  and  y  at  stage  i,  conditioned  on  the  measurement  history  Z,_i 
upto  stage  (i-l),is  Gaussian  with  means  x  and  y  and  is  given  by 


p(xMy,/Z,_,) 


where 


1 


rxpl- 


~(^,~x,,y,-y,)P1  ' 


x—x, 

y-y, 


I  (31) 


P,  *5,  = 


•r ii  s  12 
*21  s72 


(32) 


The  PDF  of  v,  ,  the  measurement  noise  in  Eq.(4),  is  expressed  as 
1 


P(v,)  = 


exp(-^-v?V  '). 


(33) 


{2k  ijVll  )'n 

In  order  to  make  the  computation  of  the  mean  range 
simpler  ,  the  CPDF  of  the  rectangular  coordinates  x,  and  y,  is  first 
transformed  to  the  polar  coordinates  R ,  and  0,  The  result  is 


R,  cos9,-.r, 
/?,sin0,-v, 


In  processing  the  measurement  z,  ,the  CPDF  is  updated 
using  Bayes'  ruteas 


p{R,,0,2,) 


p  (z,/R,  ,0,  JZ, .  i )  p  (R,  ,0, /Z,  _ ! ) 

p(r.'Z,_,) 


(35) 


By  examining  on  the  right  hand  side  term  by  term.it  can  be  noted 
thatp(/?,,9,/Z,_,)  already  known.  From  the  joint  PDF  of  z,  and  v,, 
the  marginal  CPDF p(z,lZ,_\)  can  be  reduced  as  (10) 

P(2./Z,-1)=  \  p((Zi-v,yZ,-\)  p  ( v,  )dv, 

Cu 


_  f  [ — : — 

(2jc(1  l/|i  )1,2  L  ai(t>) 


(36) 


*>,($) 


,1b)  it  Is, 

^es^)((T),J  +  A')l^|-Tv'r|A' 


a,J^(0)  2ti,(4>) ' 

°  2 

wher t0  =  i,-v,  and  A,  =  J  exp  \  ]  dx  ... 


Also, 

a,  (0)  =  s  |  |Cos20  +  2s  I2sin0cos0  +  s  22sin‘0  and 


6,(0)  =  cos0(r  nx  +  s  )2y )  +  sin0(a  ,2x  +si2y)  (37) 

The  integral  on  the  right  hand  side  of  Eq.(36)  is 
approximated  by  using  Laplace's  method  (17).  The  idea  is  to  use 
the  assumption  that  the  measurement  variance  is  small  and, 
therefore,  V'"1  is  large.  The  main  contnbution  to  the  integral,  then, 
comes  from  the  region  where  the  dominating  term,  exp(-^v,2V'~l) 

is  maximum.  The  term  multiplying  the  dominant  term  is  expanded 
in  a  Taylor  series  about  the  maximum  point  and  the  integral  is  then 
evaluated  term  by  term.  To  first  order,  the  approximated  value  of 
p  (z,/Z,  _j)  [Appendix  A.  1]  is  given  by 


p(21/Z1.i)  =  CIl/fu(0,)  (38) 

where 


K  iCs  1 


a,(B) 


6>'2(0)  b. 2 

+  -^e*PM-^l((-^<'24A,)]. 


a,J,2(0) 


2a,  (0)  2  ' 


(39) 


By  the  substitutions  of  Eqs.(34),(36),  and  (39)  into  Eq.(35),  it  can  be 
shown  that 


R,  exp[-l/2(a,7~S,a,-Hr,-0,)2V'  ')) 
2jt|fP;||  (2jtl|V/||)1/2C;1Af  1,(0,) 


(40) 


5.3  Determination  Of  The  Conditional  Mean  Ranee 


With  the  expression  for  the  approximated  CPDF  in 
Eq.(40),  R,,  the  conditional  mean  range  at  i,  is  computed  in  this 
section.  Approximations  have  to  be  made  to  the  integrals  in  the 
expression  for  /?,  assuming  that  the  measurement  variance  is  very 
small.  The  mean  value  of  the  posteriori  range  is  given  by 


R,  =  j  R,p{R,'Z,)Ri 

o 


p(R,,9,/Z,.,)  = 


2k  M 


R.  expl--(a,TS,a,)] 


(34) 


,  p-(vp„o„z— ,)p(P„o,/z,-1)^  ^  {4I) 

-io  p{t,!Z,.{) 


where 


By  the  substitution  of  Eq;(40)  in  Eq.(41)  it  can  be  shown  (after 
some  manipulations)  that  R,  reduces  to  '( 19) 


can  be  concluded  that  R^  ,the  conditional  maximum  likelihood 
estimate  given  by  Eq.(50),is  always  smaller  than'/?,  .the  condmonl 
mean  range  .subject  to  the  assumptions  made. 


R> 


~  —  +  [—  [exp(-b,2lla,)  +  (-^)|  :]  +  —  exp  (6,2/2u,)l/ 
a,  a,  2  a, 

(exp  (-/>, 2/2 a, )  +•  b,/a, 1,2  ((-^-)1'2  +  A, )) 


(42) 


where  the  terms  containing  a,  and  b,  are  evaluated  at  0,  =  z,  For 
future  companson.it  is  observed  that  when  b,2ila,  is  large 
exp(-/>,  2/2a, )  =  0  and  Eq.(42)  reduces  to 

\b,  1 

R,  =  —  + - - - 

a‘  mi  +  (-)''2a,) 

It 

By  an  asymptotic  expansion  for  A,  (17)  and  neglecting  terms 
b,2 

containing  exp  ( - ),  A,  can  be  approximated  as 

2a, 

A,  =  V2rc .  (44) 


(43) 
a,  = 


By  substitution  of  A,  from  Eq.(44)  into  Eq.(43)  ,  the  expression  for 
/?,  is  reduced  to 


R, 


(45) 


Note  that  both  terms  on  the  right  hand  side  of  Eq.(45) 
are  positive.  The  first  term  will  be  shown  to  be  common  to 
and  /?*/,.  However,  companson  of  the  second  term  of  /?,  with 
that  of  Rn  and  RUt  will  establish  the  biases  of  /?„.  and  /?Mi. 


5  4  Determination  Of  The  Ranpe  That  Minimizes  The  Likelihood 
Function. 

An  expression  for  the  maximum  likelihood  range  of 
fs(x,,y,/Z,_i)  is  obtained  in  this  section  for  companson  with  /?,,  given 
in  Eq.(45).  Maximization  of  the  CPDF  for  x.andv,  amounts  to 
minimization  of  the  likelihood  function  which  is  defined  as  (19) 

L  =  i[(z,  -e,)2^-1  +/?,2a,  ~2R,b,  +c,]  .  (46) 


where 


C,  =  4  1  lX,2  +  2s  ,2xryi  +  J  2iy  2 

The  values  of  Rn  and  which  minimize  L(in  the  process  of 
maximizing  the  PDF)  are  obtained  as  (19) 


9.-8. 


(47) 


and 

flv  =  2,-V(R,W-2R,b,')  L*.  (48) 

da,  da, 

where  a,'  =  - —  and  b,'s—~  Consistent  with  the  argument  in 
oO,  00; 

approximating  /?,,  the  measurement  noise  variance  V  is  assumed  to 
be  very  small.  Consequently,  the  term  containing  V  in  Eq.(48)  is 
neglected.  The  result  is 


=  z, 

Eq  (49)  leads  to 


* 


(49) 


(50) 


By  comparing  the  expression  for  R^  with  that  of  R,%  Eq  (45),  it 


5.5  A  Method  Of  Modifying  The  Likelihood  Estimate 

Since  the  maximum  likelihood  estimate  is 
biased.techniques  for  the  reduction  of  the  bias  are  developed  One 
method  is  determined  by  considering  the  apnon  PDF  transtormed 
from  x  and  y  to  R  and  0  which  can  be  written  as 


(51) 


After  processing  the  measurement,  z,  .the  postenori  CPDF  can  be 
computed  as 


P(R,,0,IZ,)  = 


— ' \= - W/?,  «w(— tI(z,-Q,)2V'‘i 

POJZ,- ,)  2it'iP,il  2 


+  R,2a,-2R,  b,  +c,  ])  . 


(52) 


The  transformed  CPDF  given  by  Eq.(52)  is  used  to 
obtain  the  maximum  likelihood  estimates  of  /?,  and  0,  instead  of 
defining  Rn  =  (Jtw2+yR2)112  where  x^  and  y\  are  the  maximum 
likelihood  estimates  of  the  postenori  CPDF  in  x  and  y  .  It  can  be 
proved  now  that,/?M  .the  range  thus  obtained  is  always  greater  than 


The  maximization  equations  for p(R,Q,/Z,) 

lead  to  (19) 

b,  +  (6, 2  +  4a,)1'2 

Km  = 

(53) 

2a, 

6,  =6* 

and 

0M,  = 

z,  -  V[R2a,'-2R,b,‘ 

(54) 

6.-0* 

The  assumption  that  the  measurement  noise  covariance.  V,  is  small, 
is  used  in  Eq.(54)  to  obtain  an  approximation  to  0*f,  as 


0.0,  r  r,  •  (55) 


Substitution  of  Eq.(55)  into  Eq.(53)  is  made  to  yield 
b,  +  (b2  +  4  a,)'* 


R\t.  -  ' 


2a, 


(56) 


The  right  hand  side  of  Eq.(56)  is  now  rewritten  and  expanded  to 
show  that  RMt  is  always  greater  than  Rn. 


«m,  =  T-  +  T_(]+4TT) 
2a,  2a,  b,2 


TV(,  +  IT+V 


(57) 


where  0(— )  contains  terms  of  the  series  of  order  equal  to  or 
'  a,2 

higher  than  (~^j)  ■  This  binomial  expansion  is  valid  tor 
4a, 

—j  <  1  and  this  condition  is  usually  satisfied  in  actual  cases 

al 

(Appendix  A. 2).  Neglecting  terms  of  order  higher  than  ( — - )  the 
range  RM,  can  be  approximated  as 


Km.  = 


9,=9i i=r. 


(58) 


This  expression  on  the  nght  hand  side  of  Eq.f 58)  is  always  greater 
than  /? n  in  Eq  (50)  because  a,  and  b,  are  always  positive. 


W  -• 


& 


Comparison  of  given  by  Eq.(5S)  with  in  Eq.(45) 
shows  that  Ru  is  also  biased.  However,  a  positive  bias  as  with  Ry 
may  be  less  detrimental  to  the  performance  of  the  filtering  process 
than  the  negative  bias  of  R\  because  the  range  goes  to  zero  in  the 
homing  missile  problem. 


In  order  to  prove  that  6(0)>0  the  likelihood  function  L 

is  wntten  as  follows'. 


L  =  ~  a  (0)(« -ROiR-Ri) 


& 

5s 


:fS? 

'JvJ 

n 


ft  Vj 

ft 


6  CONCLUSIONS 

In  this  study,  the  problem  of  obtaining  estimates  of  the 
states  of  a  linear  system  for  the  case  where  the  measurement 
function  is  nonlinear  has  been  considered.  A  new  maximum 
likelihood  filter  has  been  formulated  based  on  the  Taylor  series 
expansion  of  the  posteriori  conditional  probability  density  function 
around  us  mode.  The  performance  of  the  resulting  assumed  density 
filter  is  analysed  by  tracking  a  manuevering  target  with  signals  from 
a  passive  sensor  for  which  only  angle  information  is  available.  The 
numencal  results  show  that  the  error  histones  of  the  new  assumed 
density  filter  are  better  than  that  of  the  widely-used  extended 
Kalman  filter  for  initial  conditions  which  are  off-nominal.  The 
assumed  density  filter  is,  however,  negatively  biased  with  respecnve 
to  range. 

The  numencal  results  are  corroborated  by  approximate 
analytical  expressions  for  the  conditional  and  unconditional 
estimates  of  the  mean  and  maximum  likelihood  range. 


APPENDIX  A 

This  appendix  deals  with  some  derivations  that  are  used 


in  Section  V. 


1.  Proofs  To  Show  That  0(9)>O  and6(9)>0 


Consider  the  likelihood  funcnon  L  in  terms  of  x  and  y. 


It  is  given  by 

L  =  7  (x-x.y-y)S  x  ~- 
2  y~y 


after  dropping  the  subscripts  for  convenience.  In  Eq.(A.l),  the 
matrix  S,  being  the  inverse  of  the  apnon  curvature  at  the  maximum 
of  the  CPDF  is  positive.  Given  the  quadratic  form  of  the  right  hand 
side  of  Eq.(A.l),  it  can  be  concluded  that  L  is  always  greater  than  or 
equal  to  zero.  In  terms  of  R  and  9  .the  likelihood  function  can  be 
expressed  as 

L  =  (R cosB-x./?  sin0-v )  5  Pcos®~i  (A  2) 

2  '  /csin0-y  v 

where  R  =(z2+y2)‘ 2  and  9  =  tan  '(^).  The  right  hand  side  of 

x 

Eq  (A. 2)  is  expanded  to  yield 


L  =  -  (a(O)R2  ~2b(W  +  c  ) 


<*(0)  =  r  ucos;0  +  2s  i;cos0sin9  +  J22sin20  (A. 4) 

6(0)  =  cos0(s  ,,x+s  rJ )  +  sin0(s  ^u+iiiy)  (A. 5) 

and 

c  =  r  ||.r2  +  2r  u  xy  +  s22y2  (A. 6) 

Note  that  a( 9)  from  Eq(A4)  can  be  wntten  in  the 
following  form: 


•Sl"e>5  [sint. 


0(0) = (COS0 


Since  cos0  and  smO  cannot  both  be  zero  at  the  same  time,  0(9)>O. 
By  similar  arguments,  it  can  be  shown  that  c>0. 


Since  both  a(9)  and  c  are  positive,  R  [  and  R :  are  such  that  thev 
shcjld  satisfy  one  of  the  following  conditions: 

R  i  =  0  R  :<0  (All) 

/?,<0  R  2  =  0  (A. 12) 

R  i  <  0  R  2<0  (A. 13) 

R,>0  R2>  0.  (A. 14) 

In  addition  to  the  restrictions  on  L,  a,  and  c,  note  that  in  addition 
R>0  This  means  that  the  value  of  R  corresponding  to  the 
maximum  value  of  L  is  also  equal  or  greater  than  zero.  Therefore, 
the  only  possible  values  are  given  by  Eq  (A.  14)  .  In  order  to  satisfy 
Eq.(A.14),  it  can  be  observed  from  Eq.(A.9)  that  6(0)>O. 

2.  Approximations  Of  p  (z,/Z,_ ,) 

The  aprioi  PDF  of  z  is  given  by 

p(z,/Z,_|)  =  C|,(T| +T2|  (A. 15) 


i 

T  i  7  fci2os>)  i  ,  , 


Cu  =  ^F\\exp{~\c‘)  (A18) 

and  t>  =  z,-ul.  The  integration  of  the  term  T |  is  considered  first. 
The  term  exp(-— u,2V~l)  is  maximum  at  u,  =0  and  becomes  very 

small  rapidly  for  any  non  zero  values  of  u,  because  V  is  assumed  to 
be  very  small.  Note  that  for  all  values  of  0,  a,(0)  is  bounded.  Under 
these  conditions,  an  approximation  to  p(z,7Z,_j)  is  sought  using 
Laplace’s  method  (17).  Laplace's  method  involves  the  expansion  of 
the  term  multiplying  the  dominating  exponential  term  in  a  power 
series  and  the  evaluation  of  the  resulung  series  of  integrals  term  by 
term  to  any  desired  accuracy.  In  order  to  use  the  method  the  right 
hand  side  of  Eq.(A.16)  is  expanded  in  a  Taylor  series  about  u,=0. 
The  result  is 


exp(-U2ufV  x)du , 


where  g(0)  =  - -  and  primes  denote  partial  difterentiation  with 

0,  (0) 

respect  to  4  All  functions  of  6  are  evaluated  at  0  =  z,  Evaluation 


(A.23) 


of  the  integrals  in  Eq  ^A'  IS)  is  carried  out  to  yield 

Ti=SW  +  g"m'  +0(V'J).  (A. 20) 

evaluated  at  t>=:,  Since  V  is  small,  the  terms  containing  V  are 
neglected  and  the  approximated  expression  of  T  |  is  given  by 


T  i  =  g(0)  o 


In  evaluating  T 2,  it  is  recognised  that  the  term 
multiplying  exp  J-lOu.'l'-1)  also  has  an  exponential  term. 

b,~(  o) 

However,  exp(— -  )  is  bounded  for  all  values  of  0  and  it  can  be 

2a,  (0) 

expanded  in  senes.  Each  term  in  the  expansion  can  be  multiplied 
.  u  t,,m 

with  — — —  and  the  resulting  integrals  can  be  evaluated  as  in  the 

a,  (<M 

b,(<t>)  b,~(0) 

case  of  T  t.  In  other  words,  the  term  - exp  ( - — )  can  be 

a,32(0)  2a, (Q) 

expanded  in  a  Taylor  senes  about  u,  =  0  and  the  resulting 
expression  for  T2  is 

7 1  =  “(^^1)12  I l(<!>)+g  1  '«>)“.+*  i”(<W“iJ+— I 


exp(~V2u2V  ')  du, 


b,(Q)  ft,2(0) 

1:-  exP(TTZ~r)  (A-23) 

a,  (0)  2a,  (0) 

and  the  pnmes  denote  partial  differentiation  with  respect  to  0.  All 
functions  of  0  are  evaluated  at  0  =  z, .  Evaluation  of  the  integrals  in 
Eq.(A.22)  yield 

7"2  =  St(0)  +  grv  +0(VZ).  (A. 24) 

As  before  with  T [,  the  second  term  is  proportional  to  V,  the  third 
term  is  proportional  to  V2  and  so  on.  Neglecting  the  terms 
containing  V,  T2  can  be  approximated  as 


T 2  ~  «i(0) 


I  6,  (A) 

J«=t,  a  >2^) 


ft.  (A)  ft,‘U,) 

},^(z,)eXP{  2a, (z,)1 


By  substitution  of  Eq.(A,21)  and  (A.25)  into  Eq.(A.15)  gives  an 
approximation  for p(z,/Z,_,)  is  obtained  as 


1  ft.(A)  ft,2(r,) 


3  Arguments  To  Show  That  - — —  Is  "Usually"  Greater  Than 

-  a,(0)  - - - 

Unity  At  The  Maximum 

ft,2 

The  expression  for -  at  the  maximum  of  the  CPDF 

a, 

is  given  by 

j  |cor0m(.t  nT+r  t2y  >+sin9m(r  I2iF-*-.r  22>)12 

T  = - - -  (A-27> 

r  iiCOs20„+2a  |2cos0„sin0„+r  22sin2e„, 

after  suppressing  the  arguments  and  the  subscripts  of  a.b jc  ^  and  y 

for  convenience  In  order  to  analyse  the  magnitude  of  — ,  it  is 

a 

assumed  that  the  values  of  the  trigonometric  functions  of  the  apriori 
and  the  posteriori  estimates  of  the  bearings  are  equal.  That  is. 


cosO„  =  cosO  and  sin0„  =  sinO 


0  =  tan  (y !x  )• 

By  substituuon  of  Eq.(A.28)  into  Eq.(A.27),  an  approximation  for 

—  is  obtained  as 
a 

[ cos  0  (s  1  ,x  +  s  12 y )  +  sinO  (.r  12r  +  .5  2iv  ))2 


s  1  jcos‘0  +  2s  iisinOcosO  +  v  22sin‘0 

— 2  —  —2 

—  .V  ;  ;  V  +2.1'!  2 X  V  +  S  21V 


Without  losing  the  generalitv,  the  magnitude  of  —  can 

a 

be  analysed  by  examining  s  1 ,  c  *  At  the  beginning,  the  initial  range 
considered  in  a  typical  tracking  or  a  homing  missile  problem  is 
3000  feet  or  more  and  ,  therefore,  x  is  of  the  order  of  10'  Though 
the  initial  value  of  the  covariance  is  assumed  to  be  high,  thereby, 
making  in  small,  the  product  snx2  is  usually  greater  than  unity. 
As  the  problem  proceeds,  x  becomes  smaller  in  the  homing  missile 
problem.  However,  with  more  information  through  the 
measurements,  the  covanance  matrix  is  also  becomes  smaller  and, 

h2 

therefore,  in  bigger.  Consequently,  the  magnitude  of —  is 

a 

greater  than  unity. 
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Figure  3.  Range  Error  History 
(I  =  500  ft. ) 


Figure  4.  Range  Error  History 
(I  =  -500  ft.) 


Figure  5.  Range  Error  History 
(Mismatch  -=0.1) 
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A  maximum  likelihood  estimation  method  is  developed  for  applications  to  the  target  tracking  problem  based 
on  bearings-only  observations  from  a  single  observer.  The  method  involves  propagation  of  states  in  rectangular 
coordinates  in  which  ihe  linear  dynamics  permit  a  closed  form  solution.  At  the  measurement  times  the  states 
are  converted  to  a  special  polar  coordinate  system  in  which  the  measurement  is  modeled  as  linear  in  the 
transformed  state  and  updated  using  the  Kalman  methodology.  The  coordinate  transformation  is  chosen  so  that 
the  direct  transformation  of  the  maximum  likelihood  estimate  is  approximately  preserved.  The  numerical 
experiments  for  a  target-intercept  prohlem  are  presented,  which  show  that  the  performance  i  f  this  coordinate 
transformation  based  filter  is  superior  to  that  of  the  Cartesian  system  based  extended  Kalman  filter.  Approximatt 
analy  tical  results  also  corroborate  the  numerical  results. 
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Introduction 

ACTICAL  weapon  systems  require  accurate  tracking  of  ma¬ 
neuverable  vehicles  such  as  submarines  and  airplanes.  During 
the  iast  several  years  there  has  been  an  active  interest  in  the  de¬ 
velopment  of  sophisticated  filtering  algorithms  for  tracking  with 
bearings-only  observations.  Satisfactory  results  have  been  difficult 
to  obtain  using  current  mechanizable  filters  because  of  the  non¬ 
linearity  and  passive  nature  of  the  observations.  In  practical  cases, 
the  high  level  of  uncertainty  in  the  initial  states  of  the  submarine 
and  the  rapidly  accelerating  target  in  the  missile-intercept  problem 
make  accurate  estimation  of  the  states  even  more  difficult  to  ac¬ 
complish.  Considerable  research  has  been  going  on  to  improve 
existing  methods.  The  single  dominant  method  used,  in  the  ap¬ 
plications  of  the  nonlinear  filtering  methods  to  the  tracking  prob¬ 
lems.  has  been  the  extended  Kalman  filter  (EKE)1'5 

An  approach  to  achieving  a  better  nonlinear  estimator  is  to 
determine  a  state-space,  which  may  be  different  from  Cartesian 
coordinates,  for  which  improved  estimation  occurs. 611 11  The  aim 
of  this  paper  is  to  develop  a  better  and  suitable  maximum  likelihood 
filter  based  on  transformations  of  state  spaces  for  application  to  a 
target- intercept  problem.  The  formulated  polar  coordinate  filter 
(PCF)  uses  a  nonlinear  transformation  of  the  state  spaces.  How¬ 
ever.  the  choice  of  the  coordinate  systems  is  such  that  the  ap¬ 
proximate  conditional  mode  (which  represents  the  estimates  of  the 
states)  is,  except  for  one  state,  unaltered  by  the  nonlinear  trans¬ 
formations.  This  is  approximately  true  in  a  three-dimensional  co¬ 
ordinate  frame  and  exactly  true  in  a  two-dimensional  frame.  The 
development  of  the  PCF,  numerical  results  from  the  application 
of  the  PCF  to  a  homing  missile  problem,  and  analyses  arc  used 
to  explain  the  results  of  the  numerical  experiments.  In  all  analyses, 
the  Cartesian  based  EKF  has  been  included  for  comparison. 

Development  of  the  Polar  Coordinate  Filter 

The  basis  for  the  PCF  is  the  availability  of  two  convenient 
mathematical  descriptions  of  the  tracking  prohlem.  it  Cartesian 
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coordinates  are  used,  the  state  equation  is  linear  and  the  corre¬ 
sponding  measurement  equation  is  nonlinear.  It  polar  coordinates 
are  selected,  the  model  has  a  nonlinear  state  equation  bu-  a  linear 
measurement  equation.  The  idea  achind  the  PCF  is  to  exploit 
available  linearity  in  both  coordinate  systems.  This  idea  his  been 
used  in  the  past  by  Mchra *  and  Sammons'  to  solve  trae.sing  pub- 
lems.  The  algorithms  of  Mchra’’  and  Sammons'  use  the  standard 
Kalman  update  and  propagation  formulas  where  a  nonlinear  trans¬ 
formation  is  used  for  the  covariances.  In  contrast,  the  approxi¬ 
mations  used  in  the  PCF  are  made  directly  to  the  conditional 
probability  density  function  (CPDF).  and  the  approximate  con¬ 
ditional  mode  is  assumed  to  represent  the  best  estimate  ol  me  state. 
In  all  of  the  available  work  involving  transformations  of  the  state 
spaces,'' ' 8  v  it  is  tacitly  assumed  that  the  transformation  between 
the  approximate  CPDFs  preserves  the  approximate  conditional  mean. 
However,  this  is  not  the  case.  In  the  PCF.  the  coordinate  system 
is  chosen  so  that  the  approximate  conditional  mode  is  least  atfeeted 
by  the  nonlinear  transformation. 

System  Model 

The  mne-eiemer.t  state  vector  describing  the  missile-tercet  en¬ 
gagement  contains  a  three-dimensional  relative  position  vector,  a 
three  dimensional  relative  velocity  vector,  and  a  three  dimciision.il 
target  acceleration  vector  modeled  as  a  first  order  Markov  pioeess.1 
The  evolution  of  the  state  vector  in  the  inertial  frame  is  written  in 
matrix  notation  as 

,r  ••  Ft  *-  h  -i-  it  i  1 1 

where  ,v  is  die  state  vector  m  an  arbitrary  inertial  ft  ime  consisting 
ol  the  position  vector  u„.  _>i.  and  the  v  .Amo  .  nd  acee1.. ■ration 

components  are  represented  bv  ine  ox-element  'e.tor  /•  is  a 
9  v  9  matrix  ot  .onstai.ts  Ihe  nine -element  victor  e  s, inters 
(he  components  ol  the  missile  .:c>.  delation  .eitor  in  the  i ji c : t :.i  1 
axes  and  is  river,  as  />  -  |H.  0.  I).  <j,,  .  .  i).  (I  iij\  ||K- 

only  non  zero c-ir.inoncnts  mtlie  nme-e'enicn:  vi.sm  u  concs'siikl 
to  the  target  aceelcratmi.  components  u  is  a  (inu"ia,t  ecio- iiie.in 
w hue  no.se  pioeess  with  a  puwci  sp.k'r.d  deu-itv  (> 

Ihe  discrete  nonlinear  nop.  i.-miptcd  angle  ohsciv.itn  n>  a:e 
expressed  m  issta.  gul.ir  count. nates  as 

:  i,  tan  A,  iA  i  >  v , 
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"here  e  and  :■  .ire  the  measurements.  e , ,  and  v-  are  Gaussian 
zero-mean  sequences  ot  random  variables  with  a  variance  l'.  and 
subscript  i  denotes  the  time  at  which  the  measurement  is  made. 


Transformations  of  the  Xpproximute  Conditional  Probability 
Dcnsitv  functions 


Since  the  system  dvnanucs  are  linear,  its  approximate  condi¬ 
tional  mean  and  mode  are  propagated  trom  stage  w  -  1  t  to  state 
t  bv 


i  -  ihu.  i 


cNr.  ri/xridr 


where  <  t»i.  i  1 1  is  the  state  transition  matrix,  ft  is  the  a  prion 
conditional  mode  at  ..  civet!  /  .  the  measurement  historv  up  to 

<i  —  It.  and  i  1  is  the  a  [’"'tenon  ^ondittorai  mode  at  i  eiven  / 

I  he  propagation  eiiu.it/  ti  or  :ne  ipr'roxmi.i'c  a  prion  state  error 
covanar.sC'  tor  i  ,s  riven  ”v 


where  V  >  the  known  approximate  a  posteriori  conditional  co- 
variance  at  :  -  l'  P  uwis.ites  tne  inverse  ot  the  curvature  of 
the  CPDF  around  the  mode  Iheretote.  he  tore  a  measurement 
update,  tne  approximate  a  prion  t  PDF  ot  i  is  assumed  to  be  of 
the  tortn 


~  c  exp  1  -  1 1  2 )  n,r/>,  1  <>  | 


where  <r  =  |t«  -  T,  .  v  -  \H  .  .  v,r  -  vr|/.  and 

V»  .  ~R  .  and  T  are  dot t tied  to  be  the  a  priori  modes  of  .  \K  , 
:*  .  and  v  .  respectively  t  ,  is  a  normalizing  constant. 

The  transtorntation  from  the  Cartesian  coordinate  system  to  a 
polar  coordinate  svstem  is  riven  in  a  functional  form  bv  \  -  um 
where  v  includes  the  two  measurement  functions  H  and  6.  The 
choice  ot  the  other  states  is  dictated  by  the  need  to  maintain  tat 
least  approximately  i  the  direct  transtorntation  ot  the  maximum 
likelihood  estimates  between  the  CPDFs  ot  i  and  v.  In  order  to 
achieve  this,  one  of  the  states  in  v  is  the  cube  of  the  ranee  R1  and 
the  other  is  s.  The  transformed  state  y  is  given  by  v  3|/?3.  H,  d>.  v7|r 
and 


*tt)  =  (t.rj  *  y»  +■  r#l';,  tan  U\K  k„K 


tan  'l  C/f  I Xr  -  y*)'  •).  v r  | r 


The  choice  of/??  will  be  motivated  in  discussing  the  transtormation 
ot  the  CPDF  The  inverse  transformation  from  y  to  t  is  given  by 
t  =  My)  where  r  =  Ir*.  \K-  :#■  v')C  and 


hi  \ )  =  <R  3  1  cos  d>  cos  //.  /?.)'’  cos  d>  sind. 


The  CPDF  of  v,  can  ho  obtained  from  that  of  i,  as1" 


/>(>,  ,)  =  p(x,  Z,  .  |)  J 


where  J  “ '  .  the  determinant  of  the  Jacobian  of  the  transformation, 
is  found  to  be  1/3  cos <bt.  With  this  value  of  J  " 1  .  the  CPDF  of 
y,  can  be  written  in  terms  of  R 3,.  f), .  <t>:,  and  v,  as 


p(v,  Z,_,)  =  'A  cosd>,<  ,  cxp|  -  '/-( [irp]  /J  j  ] 


3  ’/<  cos  tb,  exp |  -/(  v,)| 


/J,  (/??  cos  d>,  cos  ft  —  \R  .  /?3  cos  A  suit/  —  v  H 


Ri  sin  <i  -  :H  .  tv  -  cl']' 


If  the  determinant  of  the  Jacobian  of  the  transtormation  between 
the  i  and  \  systems  is  constant,  the  modes  ot  the  approximate 
CPDFs  of  both  systems  can  be  related  by  the  transtormation 
v.  =  ci  v. ).  Since  the  determinant  of  the  Jacobian  is  not  constant, 
approximations  to  the  conditional  modes  of  the  \  system  need  to 
he  determined.  Since  cos A  multiplies  the  exponentials  in  Eq.  (9). 
the  approximation  to  the  conditional  mode  for  d>  is  of  concern. 
The  approximate  conditional  mode  <5,  is  obtained  hv  setting  the 
partial  derivative  ot  p t  v,  7.  , ).  with  respect  to  d>, .  to  zero  as 


|  i'pi  \  7  1 1 J  ( ./ <3>  i  -  t  -  tan  <5.  -  (3,f  P)  r\fn  y.  7,  ,) 


where  r  3  <ifj  in. I  he  usual  Kalman  update  formulas  for  esti¬ 
mation  are  obtained  it  tand>,  were  not  present  or  neglected.  In  the 
homing  missile  problem,  which  motivates  our  work,  the  initial 
launch  geometry  is  assumed  to  be  coplanar  in  the  .v-v  plane,  which 
means  that  the  initial  value  of  d>  is  zero.  Throughout  a  typical 
engagement,  the  magnitude  of  d>,  is  not  greater  than  45  deg  and 
the  magnitude  of  tan  6,  is  less  than  unity.  In  comparison  with  the 
other  terms  in  Eq.  1 10).  which  contain  terms  of  the  order  of  the 
range  t/?3  ‘  =  R)  or  higher.  tand>,  is  negligible.  With  this  ap¬ 
proximation.  the  approximate  conditional  modes  of  pi  y,  Z,.\)  are 
obtained  as  v,  =  gi.v, ). 

This  particular  choice  of  transformation  variables  from  Eq.  161 
approximately  preserves  the  conditional  modes  of  the  initial  and 
the  transformed  approximate  CPDFs.  There  are  some  examples  in 
the  literature1  "  where  transformations  of  the  variables  of  in¬ 
terest  have  been  used  to  form  new  state  spaces  where  the  EKF  has 
performed  better.  In  all  of  these  sets  of  state  spaces  the  assumption 
of  a  Gaussian  form  for  the  CPDF  does  not  allow  the  nonlinear 
transformations  to  preserve  the  approximate  conditional  means 
(which  are  used  to  represent  the  estimate  of  the  states)  in  the 
transformed  coordinate  systems. 

If  the  CPDF  can  be  approximated  by  a  Gaussian  form  for  /??, . 

.  6,.  and  .v,.  then  the  states  can  be  updated  using  the  Kalman 
rule1"  at  the  measurement.  In  order  to  accomplish  this,  the  argu¬ 
ment  of  the  exponent  in  Eq.  (9)  is  expanded  in  a  Taylor  series 
about  the  approximate  conditional  mode  up  to  second  order.  In 
neglecting  the  higher-order  terms,  it  is  assumed  that  the  CPDF  is 
approximately  Gaussian,  with  the  mean  approximately  equal  to 
the  mode.  The  resulting  approximated  CPDF 


p(\,/Z,.,)  =  F:  exp|  -  Y:(S,rPr ■' 6,)) 


where  (3,  3 


y,.  c;  3  'Acos <6  T,  exp|  -/( y,)|.  and  the  i/i/ith 


clement  of  l‘r  1  is  given  by 


1  Pr  '  =  I  /<  I  -  log  cos  d>,)|  |  /m  y,  )„,  in  y,  l„  | 


Fig.  1  Launch  geometry . 
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In  Eq.  (II),  P~  1  represents  the  curvature  ot  this  CPDF  at  the  a 
prion  mode  The  presence  of  the  term  expilogcosii.)  in  Lq. 
(9)  prevents  a  simple  expression  lor  P~'.  Therefore,  its  eftect  is 
examined.  The  only  term  that  log  cos  <j>,  affects  is  (?.:  ')n.  and  it 
is  given  by 


(FT'Jjl  =  — 77  (  -log  I  cose'),)  +  /(  V,»  ! 

,l,hr  ■ 


iUtr 
=  sec  -3, 


Ri,- -'F 


(13) 


where  f  is  a  function  of  P~ 1 .  sincj,.  and  cosdi,.  Note  that  the 
term  sec:i3,  in  Eq.  ( 13)  varies  between  1  and  2  as  <5  changes  from 
zero  to  45  deg.  Compared  to  the  other  terms,  which  have  a  mag¬ 
nitude  ot  the  order  of  range,  sec -<6,  is  small  and  therefore  ne¬ 
glected.  Since  the  expression  for  P~/  in  Eq.  (II)  can  be  reduced 
to 


PrV  = 


(14) 


where  g,  =  dg(x,)!d.x  is  evaluated  at  x,  -  x, .  the  approximate 
CPDF  represented  by  Eq.  (11)  is  Gaussian. 

In  the  coordinate  systems  employed  by  Mehra9  and  Sammons,3 
y’  =  [PI,  0.  <5,  s]  where  PI  Aix#  +  yp 
formed  CPDFs  have  the  functional  form 


/>(>•; /Z,..,)  =  c(y,')  exp  (/(>;) 


(15) 


since  the  determinant  of  the  Jacobian  for  their  transformations  is 
a  function  of  Furthermore,  the  determinate  of  the  Jacob’an  is 
not  even  approximately  justifiable  as  a  constant  since  it  contains 
the  range.  When  the  argument  of  the  exponential  is  approximated 
up  to  second  order,  the  resulting  approximated  CPDF  is  of  the 
form 

p(  y' ,Z, _! )  =  E<y,')exp[  -  W2(>,'  -  y,)T  Pj, -  y,’)]  (lb) 

which  is  clearly  not  Gaussian.  Consequently  the  applications  of 
Kalman  methodology  in  their  algorithms  are  not  valid. 

Updating  the  Conditional  Density  Functions  and  Estimation  in  Polar 
Coordinates 

The  a  priori  CPDF  for  P3,,  8,.  <f>,  and  s,  given  by  Eq.  (11)  can 
be  updated  by  using  Baves’  rule.10  The  a  posteriori  CPDF,  alter 
some  manipulation,  can  be  reduced  to  a  Gaussian  CPDF 

p(y,/Z,)  =  c3exp[  —  l/2(y,  -  \t)Pp,'(y,  -  y,)l  (17) 

The  objective  is  to  update  the  conditional  mode  as  if  it  were  the 
conditional  mean  with  the  curvature  at  the  conditional  mode  used 
as  the  inverse  of  the  conditional  covariance.  The  update  equations 
used  to  process  a  measurement  arc  given  by  the  Kalman  rules  as 


y,  =  >’■  + 


PpHlV,Hzu  -  e„  :2,  -  ?>,)T 


(/’, 


Hlvr 


' nry 


(18) 


(19) 


//„  is  a  constant  vector  or  partial  derivative  of  the  measurement 
with  respect  toy,  evaluated  at  y, .  and  t  ,  is  a  normalizing  constant. 
There  is  no  further  approximation  involved  in  the  update  process. 

After  the  update  m  the  polar  coordinate  system,  the  conversion 
back  to  the  Cartesian  system  is  obtained  by  tracing  (he  si  me  steps 
and  nuking  similar  approximations  to  the  value  ot  it),,  as  belore. 
J  he  PCF  is  computed  by  the  propagation  P.qs.  (3)  and  (4),  trans¬ 
formation  F.qs.  lb)  and  (14),  Idler  update  Eqs.  1 18)  and  ( 19),  and 
transformation  hack  to  the  x  system  given  by  =  iHy  ).  The  state 
covariance  ol  the  x  system  is  obtained  ill  a  manner  similar  to  P.q. 
(14),  where 


p.  -  y.p, 


(20) 
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Numerical  Results 

A  six-degree-of-treedorn  computer  program.  1  which  simulates 
the  intercept  of  a  maneuvering  target  by  a  tunk-lo  (urn.  short- 
range.  air-to-air  homing  missile  has  been  used  to  test  the  PCF  ~nd 
the  EKF.  The  guidance  scheme  that  computes  the  command'.’  1 
missile  acceleration  is  based  on  an  "optimal''  linear  guidance  law . 1 

The  launch  geometry  used  in  this  analysis  is  described  in  ;  ig 
1 .  For  this  inertial  system,  the  Z,  axis  is  directed  toward  the  harm  s 
center,  the  X,  axis  is  aligned  parallel  to  the  missile's  initial  launch 
direction,  and  the  Y,  axis  is  chosen  to  make  the  inertial  system 
rieht  handed.  The  engagement  is  characterized  by  the  initial  con¬ 
ditions:  range.  3C00  ft:  altitude,  l().(H)u  ft;  aspect  angle  it), I.  1 20 
deg:  and  off-borcsight  angle  t(K).  0.0  dee  The  numhc  of  Monte 
Carlo  trials  is  ten.  The  range-dependent  measurement  noise  model 
and  the  state  noise  mode:  and  their  statistics  arc  the  same  as  Rcl 
12.  The  diagonal  elements  of  the  initial  state  covariances  arc  It)4  it  . 
corresponding  to  ,vf- ,  10  ft*,  to  _>s  and  to  :K .  100  ft-  -ec- 
for  the  velocity  components,  and  10  it-,  see'  m  the  target  .ucei 
eration  components.  The  off-diagonal  elements  are  zero 
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Fig.  4  Range  error  history  I  /  =  -500  ftl. 


Fig.  6  Range  error  history  (mismatch  =  I0t. 
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Fig.  5  Range  error  history  (mismatch  =  0.1). 


The  error  histories  of  the  PCF  and  the  EKF  for  various  conditions 
are  given  in  Figs.  2-8.  The  error  is  defined  as  the  difference 
between  the  magnitudes  of  the  true  and  estimated  range  vectors. 
The  magnitude  of  the  estimated  range  is  obtained  by  averaging 
the  ten  Monte  Carlo  runs.  The  estimated  range  in  the  PCF  is 
computed  as  (ft3)v’. 

The  range-error  histories  for  the  nominal  case  are  presented  in 
Fig.  2  for  the  PCF  and  the  EKF.  The  performances  are  quite 
similar.  The  range  error  history  when  the  initial  range  has  pertur¬ 
bations  of  500  ft  in  the  positive  and  negative  directions  are  given 
in  Figs.  3  and  4,  respectively.  The  PCF  clearly  out  performs  the 
EKF  Since  both  the  observer  and  target  are  constantly  maneu¬ 
vering,  the  PCF.  having  states  measurement  functions,  is  able  to 
utilize  the  information  better  than  the  EKF.  By  comparing  the 
performances  of  the  EKF  in  Figs.  3  and  4,  it  can  be  observed  that 
the  range  errors,  with  a  positive  initial  perturbation  in  range,  arc 
much  worse  than  those  with  a  negative  initial  perturbation  Thus 
the  EKF  is  biased  The  performance  of  the  PCF,  however,  is  more 
even  with  positive  and  negative  perturbations. 


Since  the  filter  does  not  know  the  actual  measurement  noise 
variance,  a  measurement  mismatch,  defined  as  the  ratio  of  the 
actual  value  to  the  assumed  measurement  covariance  in  the  filters 
is  hypothesized  in  generating  the  measurements.  The  range  errors 
of  the  simulations  with  a  measurement  mismatch  of  0. 1  is  presented 
in  Fig.  5.  The  performance  trends  of  both  filters  are  similar  to  the 
nominal  case  in  Fig.  2.  For  the  higher  mismatch  of  ten.  both  the 
PCF  and  the  EKF  are  equally  affected,  as  seen  in  Fig.  6. 

To  simulate  actual  situations,  where  uncertainties  in  both  the 
states  and  the  noise  statistics  can  occur,  experiments  are  made 
with  perturbations  to  the  initial  states  and  measurement  mis¬ 
matches.  The  range  error  history  w  ith  an  initial  state  error  of  500  ft 
and  a  measurement  mismatch  of  0.1  is  presented  in  Fig.  7.  With 
the  same  initial  error  and  a  measurement  mismatch  of  ten  the  results 
in  the  range  errors  are  given  in  Fig.  8.  It  is  clear  that  in  both  cases, 
the  PCF  has  a  better  response  to  perturbations  than  the  EKF. 

Biases  Associated  with  Different  Formulations  of  the  Extended 
Kalman  Filter 

Observe  from  the  given  range  error  histories  that  the  range 
estimate  ft,  of  the  EKF  has  a  larger  bias  than  the  range  estimate 
of  the  PCF.  This  phenomenon  can  be  explained  bv  developing  and 
comparing  the  expressions  for  the  square  of  the  a  posteriori  range 
estimate  of  the  EKF  with  that  of  the  PCF.  For  simplicity,  the 
differences  in  the  expressions  for  the  conditional  range  estimate 
are  provided  from  a  two-dimensional  tracking  problem.  In  this 
case,  the  polar  coordinate  for  the  PCF  is  (/?-,  0)  where  ft  is  the 
range.  The  posterioric  estimate  of  the  range  in  the  PCF  is  defined 
as  Rp  =  R where  ft;,  is  the  estimate  of  ft;  In  order  to  illustrate 
the  ditterences  in  the  estimates  of  range  with  a  different  polar 
coordinate  system,  the  expression  for  ft,,  the  estimated  range  from 
the  titter  with  a  transformation  from  ft.  v}  to  (ft.  0}  is  derived. 
It  is  proved  that  ft,  2  Rp  and  shown  that  ft,  2  ft,.  This  inequality 
helps  explain  the  smaller  range  biases  of  Mehra's  polar  coordinate 
filter'*  over  the  EKF  for  a  reentry  problem. 

The  Approximate  Conditional  Range  from  the  EKF 

It  is  assumed  that  =  (,ts  .  yR  )  is  the  state  vector  to  be 
estimated  w  ith  know  n  v„.  The  expressions  for  the  a  posteriori  range 
estimate  are  developed  at  the  first  measurement  where  the  known 
a  priori  conditions  for  all  the  tillers  tire  assumed  to  be  the  same. 
The  state  x„  has  a  Gaussian  probability  density  function  with  mean 
x„  and  covariance  ft,,.  The  EKF  for  the  estimation  of  i„  is  given 
by 

=  t„  +  Av„  (21) 
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Fig.  7  Range  error  history  ; mismatch  =  0.1,  /  =  500  It). 


Fig.  8  Range  error  history  (mismatch  =  10,  /  =  500  fti. 


where  Ax„  2  //'  V~ 1 v„,  Plt  is  the  a  posteriori  conditional 

covariance  matrix;  //,„  is  the  first  partial  derivative  matrix  of  the 
measurement  function  with  respect  to  the  state  evaluated  at.r„;  and 
n<,  equals  [:?  -  /t(.Y0)].  The  a  posteriori  estimate  of  the  square  of 
the  range,  A,o,  is  computed  from  the  EKF  c'timatcs 

R2,.  =  ilia  =  +  2xf,Sx0  +  A.vjAx,,  (22) 

The  Conditional  Range  Estimates  from  the  PCF 

The  transformation  equations  from  the  Cartesian  system  to  a 
new  polar  system  >•„  =  (/£.  0(.]T  is  given  by  y„  =  gtx,)  where 
g(xj  =  [(xj^  +  ys.  i,  tan'1  (»/x>„))r.  This  new  polar  coordinate 
system  is  chosen  so  that  the  Jacobian  of  the  transformation  is 
constant  without  approximations.  Therefore,  the  conditional  modes 
are  invariant  under  this  transformation. 

The  updates  of  the  state  y0  and  the  covariance  /'  at  the  first 
measurement  are  calculated  with  Eqs.  (181  and  (!9i.  Tne  partial 
derivative  of  the  measurement  function  in  the  \  and  t  svstems  can 
be  related  at  Hrt  =  UW0)i(r>\„)  ~  //,.g!  .  This  expression  for 
HPi,  and  Eq.  (20)  for  PPo  are  substituted  into  Eq.  I  18)  lor  y„  to 
yield 

v„  =  >„  f  .?f„Axfl  (23) 

The  posterioric  estimate  of  R-  is  given  by  (A'  - 


are  linear  in  the  states  is  given  by  y„  =  gix  )  where 

«?<•*„)  “  l'**.  +  >>,)'•'.  tan  ' :  (»;.**  }|r  (26) 

The  update  equation  for  the  ranee  ;.i  a  measurement  is  given 
by 

~  Ru  +  gi.A.v,  27) 

where  contains  the  elements  of  g.  related  to  'ha  R  clement, 
and  is  given  by  g,  =  |.T*  A, ,  )R  R  j.  The  square  o:  the  estimate  J 
ranee  computed  bv  this  tiller  is  obtained  tier  comparison  with  mat 
of  the  PCF. 

Hi  =  Hi  -  2R„&U  Ax„  v-  Aa-rJ  e-  At,  (2t»i 

where 


Note  that  the  determinant  of  the  matrix  gf  g,  is  zero,  therefore, 
one  eigenvalue  is  zero.  Also,  the  trace  ot  ef  g,  is  unity,  and. 
therefore,  the  second  eigenvalue  is  one  Consequently,  gj  g  can 
be  written  as 


A~o  =  R-.  +  2a, 'At,  1 24) 

The  bias  in  the  polar  formulation  can  be  demonstrated  bv  dif¬ 
ferences  in  values  of  A*  from  Eq.  (24)  and  from  R]  given  bv 
Eq. (22) 

A’,  -  A',  =  A  t'A.t,,  1 25) 

This  result  is  significant  in  the  context  of  the  iiujiicn_.il  results  ot 
Figs.  3  and  4.  The  t.ingc  error  histuries.  in  response  to  position 
perturbations  in  f  ig'.  3  and  4  show  ih.it  the  I.KF  sccm>  positively 
biased.  Since  the  analytical  results  show  that  A.  ■  A;  .  the  relative 
performance  ot  tiic  PCF  seems  less  biased  than  the  F.KF 

I  he  ( Tinditional  Range  Fslimale  of  the  filter  with  R  anil  0  as  Slates 
A  formulation  is  now  attempted  along  die  same  lines  as  Sim¬ 
mons'  polar  coordinate  tiller  I  he  transformation  ot  live  rectangular 
states  to  a  polar  system  \  J  j R  .  <i  |'  where  tne  l  iC.isuteiivnts 


where  the  columns  of  T  are  the  e'genvcctois  ot  g!  g.  Fite  exptes- 
ston  lor  R  from  Eq  1 2 S i  is  rev.ritien  with  the  expression  Ua 
gf  g.  from  Eq.  ( 30 1  By  replacing  R  with  t  t„ 


2v'A  , .  -  At 
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0  1 


CD 


Bv  subtracting  R  Irom  R..  .  given  by  to  (2Ji.  the  dilletei'ces  m 
the  range  estimates  obtained  through  tile  two  dillctciit  polar  lor 
mulalions  carl  be  show  n 
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Note  that  (Ik-  right  hand  side  of  Eq  ( 32)  is  nonpositive.  Therefore. 

R.  ~c  Ri 

Bs  Jifferencirtg  Hi  in  Eq,  (31)  from  K;  (Eq.  22)  an  important 
result  is  obtained  as 


At, 


ft! 
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H;.  -  Hi  =  A.v! 

/  -  r. 
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(33) 


The  right  hand  side  of  Eq.  (33)  in  this  ease  is  always  non-negative. 
Therefore,  the  estimate  of  the  range  R,  with  the  Cartesian  tor- 
mulation  of  the  EKF  is  usually  greater  than  the  range  R.  obtained 
with  an  R  and  4  transformation  for  x  and  v . 

Mehra'  reported  results  from  a  reentry  tracking  problem  where 
the  range  estimates  of  the  EKF  formulated  in  a  rectangular  Carte¬ 
sian  frame  are  always  more  positively  biased  than  his  polar  co¬ 
ordinate  filter.  The  inequality  R ,  S  R,  helps  to  explain  Mehra  s 
results. 

Conclusions 

The  problem  of  obtaining  estimates  of  the  relative  states  of  a 
homing  missile  with  respect  to  a  target  and  the  target  acceleration, 
using  bearings  as  the  only  measurements,  has  been  considered  A 
new  maximum  likelihood  filter  has  been  formulated.  This  polar 
coordinate  tiller  exploits  the  fact  that  the  system  dynamics  are 
linear  in  rectangular  coordinates  and  the  measurements  are  linear 
in  polar  coordinates.  The  polar  state  space  has  been  chosen  so  that 
the  direct  transformation  of  the  maximum  likelihood  estimates 
between  the  spaces  is  preserved  in  two  dimensions  and.  under 
certain  assumptions,  approximately  in  three  dimensions.  The  nu¬ 
merical  results  show  that  the  estimates  of  the  range  by  the  extended 
Kalman  filter  have  a  larger  bias  than  that  of  the  polar  coordinate 
filter.  Expressions  for  the  posteriori  estimates  of  the  range  for 
these  filters  have  been  derived  and  compared  to  corroborate  the 
numerical  results. 
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On-Line  Aircraft  State  and  Stability  Derivative  Estimation 
Using  the  Modified-Gain  Extended  Kalman  Filter 


Jason  L.  Speyer*  and  Edwin  Z.  (Truest 
University  of  Texas,  Austin,  Texas 


A  new  on-line  slate  and  parameter  identification  algorithm  called  the  modified-gain  extended  Kalman  filter 
(MGEKF)  is  applied  to  the  problem  of  on-line  state  estimation  and  identification  of  the  stability  derivatives  of  a 
F- Ill  type  of  vehicle.  The  conceptual  basis  for  the  MGEKF  is  the  existence  of  a  class  of  nonlinear  functions  that 
allow  a  universal  linearization  with  respect  to  the  measurement  function.  This  class  includes  the  problem  of 
identification  of  linear  systems.  The  previous  single-output  formulation  is  extended  to  a  multioutput  formulation 
where  the  only  available  measurements  are  acceleration  and  pitch  rate,  but  not  elevator  deflection.  The  filter 
formulation  includes  a  simplified  Dryden  wind  gust  model.  The  inclusion  of  the  wind  gust  model  results  mainly  in 
a  slowed  response  in  the  estimation  of  the  stability  derivatives  associated  with  the  acceleration  state:  estimates  of 
the  stability  derivatives  associated  with  the  pitch  rate  still  respond  very  quickly.  The  accuracy  of  the  acceleration 
stability  derivatives  depends  upon  the  amplitude  and  frequency  components  of  the  persistently  exciting  dither 
signal. 


I.  Introduction 

HE  historical  development  of  aircraft  parameter  identifi¬ 
cation  is  given  in  Refs.  1-3.  These  studies  are  designed 
primarily  for  off-line  use.  A  very  complete  study  of  recursive 
identification  schemes  for  on-line  use  is  given  in  Ref.  4. 
However,  the  usual  assumption  that  the  parameters  be  con¬ 
stant  produces  gains  that  are  asymptotically  inversely  propor¬ 
tional  to  time  and  therefore  become  vanishing  small.  These 
schemes  are  not  applicable  to  aircraft  systems  that  must 
operate  continuously  and  identify  changes  in  the  stability 
derivatives  as  the  flight  conditions  change.  One  motivation  for 
this  type  of  on-line  state  and  parameter  estimation  scheme  is 
for  use  in  adaptive  flight  control  systems. 

In  Ref.  5,  various  schemes  for  identifying  constant  system 
parameters  are  compared  on  a  common  problem.  Among 
these  schemes  is  the  extended  Kalman  filter  (EKF)  whose 
performance  is  shown  to  be  relatively  poor.  This  problem  was 
again  analyzed  in  Refs.  6  and  7,  where  a  new  estimation 
scheme  called  the  modified-gain  extended  Kalman  filter 
(MGEKF)  is  used  For  a  special  class  of  nonlinearities  of 
which  state  and  parameter  estimation  in  linear  systems  is  a 
member,  there  exists  a  universal  linearization  of  these  special 
nonhneanties  with  respect  to  the  measurement  function.  In 
order  to  obtain  nonlinearities  in  this  class,  the  observability 
coordinate  system  rather  than  the  controllability  coordinate 
frame  is  used  for  the  problem  in  Ref.  5.  The  results  given  in 
Ref.  7  indicate  a  remarkable  improvement  in  performance. 
The  MGEKF  described  in  Ref.  7  is  applied  here  to  the 
problem  of  on-line  state  estimation  and  identification  of  the 
stability  derivatives  of  an  F-lll  type  of  vehicle. 

Section  II  presents  the  definition  of  a  modifiable  nonlinear 
system  function  that  forms  the  basis  of  the  MGEKF  al¬ 
gorithm  A  simple  illustration  of  a  modifiable  nonlinearity  is 
given  before  the  general  form  of  the  MGEKF  algorithm  is 
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stated.  The  essential  features  of  this  algorithm  are  then  dis¬ 
cussed.  The  dynamic  system  model  for  the  aircraft  is  pre¬ 
sented  in  Sec.  III.  The  short-period  longitudinal  mode  of  the 
aircraft  is  expressed  in  acceleration  and  pitch  rate  states  to  be 
consistent  with  the  measurements.  In  addition,  a  first-order 
model  for  the  actuator  and  a  second-order  simplified  Dryden 
wind  gust  model  are  described.  In  Sec.  IV,  the  mechanization 
of  MGEKF  using  this  aircraft  model  is  discussed  and,  in  Sec. 
V,  the  performance  of  the  MGEKF  algorithm  using  accel¬ 
erometer  and  pitch  rate  gyros  is  presented  Conclusions  and 
recommendations  are  given  in  Sec.  VI. 

II.  The  Modified-Gain  Extended  Kalman  Filter 
Algorithm 

The  dynamic  nonlinear  system  model  used  for  combined 
state  and  parameter  estimation  is  presented  first.  The  defini¬ 
tion  of  a  modifiable  nonlinear  system  function,  used  as  the 
basis  of  the  MGEKF  algorithm,  is  stated.  Then  the  MGEKF 
algorithm  is  presented  and  its  properties  discussed. 

Dynamical  System  and  Modifiable  Nonlinearities 

The  discrete  dynamic  system  model  used  for  combined  state 
and  parameters  identification  is 

>’,*i  +  B(6,)u,  +  w,  (1) 

8,^  =  8,  +  *,  (2) 

and  the  scalar  measurement  is 

z,  =  My,  +  v,  =  +  v,  (3) 

where  y,  is  an  n-dimensionaJ  state  vector.  8 ,  is  a  vector  of 
maximal  dimension  2 n  of  unknown  parameters  representing 
the  elements  of  the  matrices  A (0,)  and  B(8, ),  A (8,)  is  an 
n  x  n  matrix,  and  B(8, )  is  an  n  vector  where  both  contain  up 
to  n  unknown  elements  represented  by  the  elements  of  8(,  u, 
is  a  known  scalar  input,  r*  is  the  scalar  measurement  func¬ 
tion.  //  is  a  known  1  Xn  measurement  matrix,  and  *, ,  »■  , 
and  t\  are  zero-mean  white  noise  sequences  with  variances  Q , . 
Q, ,  and  y, ,  respectively  The  formulation  given  here  is  for  a 
single-input /single-output  system  consistent  w  ith  the  results 
of  Ref  2  Although  the  extension  to  more  than  one  input  is 
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trivial,  the  extension  to  more  than  one  output  takes  some  summarized  as 
innovation.  This  extension  is  done  in  the  following  sections. 


The  nonlineuntv  in  this  problem  is  .4(0,)  i,. 

For  conveni- 

v) 

(10) 

ence  define 

x,  =  .v.  +  A,  ( z,  -  tlx, ) 

(11) 

x[  =  \yf.  9[] 

(4) 

A,  =  M  Hr(H\fJlr  +y,)  ‘ 

(17) 

and  the  nonlinearity  as 

.U  .  ,  =  A(  .  7, )  />,  A<  ,  v, )  -  Q, 

(13) 

,,  ,  A  f.4(0,).v,  +  B(0,)t/, 

/U)  =  [  ^ 

(5) 

/>,=(/-  A ,//)//(/  -  A,  // )  r  +  A,  y,  A/ 

(14) 

where  v,  has  maximal  dimension  of  3 n. 

Modifiable  Noniineanties 

The  notion  of  a  modifiable  nonlinearity  is  that  there  exists 
universal  linearization  of  the  function  /(.x, )  with  respect  to 
the  measurement  function  z“ . 

Definition  :  A  function  /:  Rr  -»  Rp  is  a  modifiable  nonlin¬ 
ear  system  function  if  there  exists  a  pXp  matnx  F:  R  x  Rr 
—  R  r  'p  so  that  for  any  state  x,  and  known  estimate  of  the 
state  .{,. 


where  v,  is  the  propagated  state  and  parameter  estimates.  A, 
the  modified  Kalman  gain  calculated  by  Eqs.  (12-14),  \ft .  , 
the  propagated  pseudo  error-covariance  matrix,  P j  the  up¬ 
dated  pseudo  error-covariance  matrix,  Q,  the  process  noise 
covariance  matrix  composed  of  diagonal  matrix  elements  Q, 
and  Qt .  and  y,  the  measurement  noise  covariance  matrix. 

The  functions  /(.v. )  and  F(:t,x,)  can  be  expressed  in  a 
simple  way  when  the  dimension  of  0,  is  assumed  to  be  In. 
Note  that  B(  0 )  becomes  just  the  last  n  elements  of  0  . 
Furthermore,  the  function  F  (z*,.x,)  obtained  from  a  modili- 
able  nonlinear  function  /( ,v, )  becomes 


f(xl)-f(x,)  =  F(:r.x,)(x,-x,)  (6) 

where  z.*  =  fix,. 

Note  that  F(  z  * ,  x, )( x,  -  .x, )  in  Eq.  (6)  is  a  universal  lin¬ 
earization  of  /(.x,)  with  respect  to  the  measurement  function 
z*  without  any  approximation.  Notice  that  the  known  func¬ 
tion  F(  z  * ,  x, )  =  F(  //.x, ,  ,x; )  *  F(  fix, ,  x, ),  where  the  latter 
quantity  is  the  differential  of  /  evaluated  at  x,  as  used  in  the 
linearization. 

The  noiseless  case  of  a  simple  linear  dynamical  system  with 
an  unknown  coefficient  illustrates  the  idea  of  a  modifiable 
nonlinearity.  The  system  is  represented  as 

8, -i=0,.  (7) 

where  v,  is  a  scalar  state  at  stage  i  and  0,  the  unknown 
parameter  at  stage  i.  The  nonlinearity  is  put  into  modifiable 
form  by  writing 


_  U.  f,  -0,y,  +0>;  -0,.V, 


where  /„  is  an  n  X  n  identity  matrix  and  0,„ x„  is  a  2 iiXn 
matrix  of  zeros.  In  Ref.  7.  this  matrix  is  obtained  in  the 
observability  canonical  form  where  the  unknown  parameters 
lie  in  the  last  column  of  the  A  matrix.  It  should  be  noted  that 
in  the  gain  algorithm  for  propagating  the  pseudo  error-covari¬ 
ance  matrix  [Eq.  (13)].  the  actual  measurement  is  used 
rather  than  the  measurement  function  r*  in  Eq.  (15).  Finally, 
for  use  later  when  describing  the  MGEKF  for  the  aircraft 
application 

/(.;,)  =  F(0..v,).x,  (16) 


The  key  to  applying  the  MGEKF  to  the  parameter  estima¬ 
tion  problem  is  to  ensure  that  those  unknown  parameters 
being  identified  enter  the  dynamic  equation  so  as  to 
multiply  the  states  or  controls  that  are  directly  measured.  As 
shown  in  Ref.  7,  this  means  that  the  coordinate  frame  must  be 
chosen  carefully.  Furthermore,  the  results  given  in  Ref.  7 
apply  to  only  a  single  output  problem.  The  results  here  give  an 
example  of  how  the  MGEKF  can  be  extended  to  two  or  more 
outputs. 


=  F( :  * ,  x, )( .x,  -  Jt, )  (8)  III.  The  Aircraft  Dynamical  System 

The  linear  longitudinal  dynamics  representing  the  short 
where  x,r  =  [ y, .  0,].  y  and  0  are  the  estimated  values  of  y  and  period  motion  are 
0,  and 

a  =  Z„a  +  Zqq  +  Zee  -  Zaac  +  ba  (17) 

(9)  q  =  Mna  +  Mqq  +  Mee  -  Maa,,  +  bq  (18) 


F(;,*..x,)=  ,(.x,-.x,)  = 


Note  that  the  estimation  error  in  Eq.  (8)  is  propagated 
without  approximations  by  a  linear  equation.  Since  the  mea¬ 
surements  are  linear,  the  update  formula  for  the  error  is  also 
linear.  In  the  noiseless  case  where  this  filter  reduces  to  a 
nonlinear  observer,  the  error  of  this  observer  is  shown  to  be 
exponentially  convergent  by  Lyapunov's  second  method/' 7  fn 
the  noise-corrupted  case  where  only  the  noisy  measurement  is 
available  and  not  the  measurement  function,  under  certain  a 
priori  unchcckable  conditions  the  MGEKF  is  shown  to  be 
exponentially  bounded  in  the  mean  square  sense.6-7 

The  MGEKF  Algorithm 

The  discrete  formulation  of  the  MGEKF  from  Ref.  7,  based 
on  the  dynamic  system  [Eqs.  (1-3)]  using  Eqs.  (4)  and  (5).  is 


where  a  is  the  total  angle  of  attack,  q  the  pitch  rate,  e  the 
elevator  deflection,  a(:  the  angle  of  attack  due  to  wind  gust,  bn 
and  bq  the  trim  biases  associated  with  the  steady-state  condi¬ 
tions  of  a  and  q,  respectively,  and  Z„,  Zq,  Ze,  M(t,  A/  ,  and 
A/,  the  aircraft  stability  derivatives. 

Transformation  of  State  Space 

Aircraft,  such  as  that  in  Ref.  8  and  the  F-lll  type,  have 
normal  acceleration  and  pitch  rate  measurements  available 
from  an  accelerometer  and  pitch  rate  gyro  Therefore,  it  is 
advantageous  to  convert  from  angle  of  attack  to  acceleration 
in  the  dynamical  representation  of  the  aircraft  for  MGEKF 
applications 

The  accelerometer  measures  the  combined  acceleration  of 
the  center  of  mass  and  the  acceleration  relative  to  the  center 
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of  mass  due  to  the  moment  arm  XJCC.  This  relationship 
between  A,  q ,  and  a  is 


a-Pi  (?-«)  +  M 

v.  here  A  is  the  normal  acceleration  (in  g ’s)  and 

/3,= 


36  Og' 


Y 

a  A 

mg 


(10) 


(20) 


where  u  is  the  aircraft  velocity  (in  ft/s),  g  the  gravitational 
acceleration,  and  XACC  the  X  distance  from  the  aircraft  c.g.  to 
the  accelerometer. 

Equations  (17-19)  can  be  used  to  derive  a  new  set  of 
dynamical  equations  using  A  and  qH  The  new  dynamical 
system  is 

A  =  D4 A  +  D.q  +  D,e  +  DGaG  +  (2,/  +  Q,;dc  +  BA  (31) 


q=  HaA  +  H  q  +  Hte  +  Hcac  +  B 


(22) 


where 


Da*Z.+ 


Si. 


D,  -  ZsQa  -  ZaI2.  +  M,Qq  - 


MSI  SI. 
a  •»  1 


HA 


$2. 


f,  A  If  _ 


,  s  *=  q,  e,G 
s-q,e,G 


PxZ„  r  =■  a,  e,  G 

B 


,  O  _  ^^2  n 

fix  ^ 


V 


Piba-p2bq 


Si. 


+  *„ 


Note  that  Zc  and  A/c  are  the  stability  derivatives  for 
acceleration  and  pitch  rate  associated  with  wind  gust  effects. 
It  can  be  seen  from  Eqs.  (17)  and  (18)  that  ZG  =  -Z„  and 
Mc  =  -A/a;  if  these  relationships  hold,  they  imply  that  D(-  ™  0 
and  //c  =  0.  Interestingly,  this  results  in  only  the  acceleration 
equation  being  directly  affected  by  the  wind  gusts  and  then 
only  by  drc.  Although  not  directly  affected  by  wind  gusts,  the 
pitch  rate  is  affected  by  the  wind  gusts  through  the  accelera¬ 
tion  term.  The  aircraft  dynamics  can  now  be  written  as 

A  OaA  +  D^q  A-  Des  4  S2re  —  SlaaG  +  BA  (33) 
q  -=  HaA  +  Hqq  +  II, e  +  £,  (24) 

Note  that  BA  and  Bq  are  biases  associated  with  A  and  q, 
respectively.  In  the  next  sections,  the  elevator  actuator  and 
wind  gust  models  are  described. 

Elevator  Dynamics 

A  measurement  of  the  elevator  deflection  is  not  available. 
The  first-order  elevator  actuator  dynamics,  which  determine 
the  actual  position  of  the  elevator  in  response  to  an  elevator 
command  e  ,  are  assumed  to  be  of  the  form 


//,e+  ff.e, 


(25) 


where  //,  and  I!  reflect  the  dominant  dynamical  characteris¬ 
tic  of  the  elevator  actuator  As  in  Ref  8,  the  actuator  d.namic 
coefficients  arc  assumed  to  remain  constant",  therefore.  //,  .md 
//j  need  not  be  estimated  However,  since  the  actuui  elevator 
deflection  is  not  measured,  it  mu-t  be  estimated  on-line. 


Wind  Gust  Dynamics 

The  wind  gust  model  is  also  included  in  the  formulation. 
Two  commonly  accepted  wind  gust  models  used  in  the  analy¬ 
sis  of  aircraft  motion  are  the  Drvden  and  the  von  Karman 
models.9  For  estimation  purposes,  a  simplified  Drvden 
model9’1'’  winch  compares  very  well  with  the  Drvden  model,  is 
used.  The  simplified  Dryden  wind  gust  model  is 


iic  =  K^a,;  +  K.ac  +  <*y;(  t ) 


(26) 


where  u>(7  is  a  zero-mean  white  noise  process  with  spectral 
density  K-lQ.  and 


G»-2 


C,  o2I.„. 

C\  u 


r  i+2? 

U:  ,.4/j  - 


C,  = 


(l4/l)4/) 


b 
2  L 


L  =  L 


_h_ 
h  +  hn 


U" 


AA  =  -C2 


where  h  is  the  altitude  (in  ft),  h,,  =  2500  ft,  L „  =  2000  ft,  b 
the  wing  span  (in  ft),  u  the  aircraft  velocity  (in  ft/s),  and  o 
the  rms  gust  velocity  (in  ft/s).  By  letting  it,  =  a(:  and  a:  =  ac, 
the  following  set  of  linear  equations  is  obtained  from  Eq.  (26): 

*,-«2  (27) 


,  =  ff,a,  +  K:a:,  +  A',uf;(  t) 


(28) 


Augmented  .State  Variables  and  Modified  N’onlinearities 

If  Eqs.  (27),  (28),  and  (25)  are  augmented  to  the  dynamics 
of  Eqs.  (23)  and  (24),  then  the  dynamical  system  is  expanded 
to  fifth  order  in  the  states  A,  q.  c,  a,,  and  If  in  addition, 
the  constant  biases  B4  and  Bq  are  augmented  as  states,  then 
we  obtain  a  seventh-order  system. 

However,  the  system  dynamics  are  not  modifiable  since  the 
unmeasured  state  e  multiplies  some  of  the  parameters  to  be 
estimated.  However,  the  system  dynamics  can  be  made  mod¬ 
ifiable  by  replacing  state  e  with  two  new  states  defined  as 


Xx  =  Dte,  ,Yj  A  Hte 


(29) 


where  £>.  A  De-  II]Sle.  Note  that  e(,  the  commanded  elevator 
position,  is  available  and  is  assumed  to  be  known  perfectly. 
This  results  in  the  following  modifiable  dynamical  system: 
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where  0,.  =  //.£_.  Note  that  bv  this  device  the  parameters  D 
and  //.  are  now  multiplied  by  e  .  which  is  a  known  input. 
Thus,  with  acceleration  and  pitch  rate  measurements,  the 
nonlinearities  are  modifiable  noniineanties.  As  stated  in  Ref 
7.  it  is  important  that  the  system  be  observable.  By  using  the 
observability  test  in  Ref.  11,  it  can  be  shown  that  this  svstem 
is  observ  able. 

Transformation  from  Continuous  to  Discrete  Form 

Since  the  discrete  version  of  the  MGEKF  will  be  imple¬ 
mented,  the  aircraft  dynamical  equations  must  be  transformed 
into  discrete  form.  By  assuming  the  sample  time  Ar  to  be 
sufficiently  small,  the  discrete  dynamical  equations  are  ap¬ 
proximately 


.4 

'  d4 

D. 

A 

1 

0 

0 

R  i 

Ar 

0  " 

<7 

II. 
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Dt  =  DjAr  -c  1,  D,=Z>,Ar 
£>,  =  DeSt,  II  t  =  HA±t 

®  W,  A  r  +  1 ,  //,,«//,  A  r 

S/l=H2[Q'(DAi  +  \)+Dtl]^- 

Note  that  .tt  =  De  e ,  and  x2  =  II,  e,  and  that  the  following 
parameters  are  assumed  known  at  each  time  step  r: 

Cx~e~H^,  C2  =  ^(l-C,) 

Ual=^4^.  Ual  =  Rir1c 

R^K^t,  /G  =  K,Ar+l,  R,=  -flaAr 

„  KA±t)2  „  (2  +  K\Ar)  K, 

K“~  2  ’  2 

where  7(,;  is  a  zero-mean  noise  sequence  with  variance  (7,/Ar. 
In  making  the  discrete  approximation,  the  exact  discrete  form 
is  used  when  convenient:  otherwise,  the  above  is  the  first  term 
of  a  Taylor  senes  in  A r. 

IV.  Implementing  the  MGEKF 

The  MGEKF  algorithm,  formulated  in  Eqs.  (10-14),  is  to 
be  applied  to  the  problem  of  estimating  the  aircraft  states  and 
parameters.  This  algorithm  is  extended  from  a  single  output  to 
the  output  of  acceleration  and  pitch  rate.  We  have  already 
shown  that  the  dynamic  nonlinearities  are  modifiable  nonlin¬ 


earities.  The  acceleration  and  pitch  rate  measurements  are 

-4,  =  A,  +  ,V  =  1,  ^  (32) 

where  rt  and  v  are  constant  standard  deviations  associated 
with  the  measurement  noise  of  ,4  and  q,  respectively,  so  that 


y,  = 


«■«-  0 

0  c; 


(3?) 


The  formulation  of  the  MGEKF  algonthm  requires  that  the 
matrices  F{ !!,  and  fix,)  be  formed  from  the  aircraft 
dynamic  svstem  [ Eq  (31))  and  measurements  [Eq.  (32)].  Since 
we  did  not  include  the  biases  (fi,,  B ,)  in  our  linear  simula¬ 
tion.  thev  arc  not  included  in  the  state  space  defined  now  as 

,v,r  =  [  .4 .  </,  ,V, ,  Y,  .  a, ,  a , ,  D< ,  Dq ,  llA ,  II, ,  S4 ,  S, .  D, .  H,  ] 

(34) 

(35) 

(36) 


and  the  measurement  is  defined  as  the  two-vector 


Then 


-  1-4- 


ESTIMATED  PARAMETERS  DY1=0.003  g's 

ACTUAL  PARAMETERS  DY2  =  0.004  deg/sec 


TIME  (SEC) 

Fig.  I  Parameter  tracking  with  a  step  change  in  flight  conditions. 
WG  =  1  ft/s,  accurate  instruments,  and  low-amplitude  dither  signal. 


•ESTIMATED  PARAMETERS  DY1=0.003  g's 
ACTUAL  PARAMETERS  DY2  =  0.004  deg/sec 


TIME (SEC) 

Fig.  2  Parameter  tracking  with  a  ramp  change  in  flight  conditions. 
WG  =  1  ft/s,  accurate  instruments,  and  low-amplitude  dither  signal. 
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Note  the  placement  of  the  measurements  in  the  error  dy¬ 
namics  matrix  F(z:,  ,v,>  [see  Eqs.  (6)  and  del).  Lath  measure¬ 
ment  multiplies  the  corresponding  parameter  error  for  that 
particular  state.  For  example,  the  acceleration  measurement 
:A  multiplies  its  corresponding,  parameter  error  (  D4  -  .0,). 
No  parameter  associated  with  any  of  the  states  exccDt  acceler¬ 
ation  and  pitch  rate  is  identified.  Since  input  t\.  is  multiplied 
by  C2  in  the  rows  associateo  with  stales  .V.  .end  A,,  the 
estimated  parameters  are  now  D,  and  II t  rather  than  O-Z?, 
and  Ch  iir . 

The  Process  Noise  Covariance  Matrix  and  Filter  Tuning 

Process  noise  is  assumed  for  the  parameters  [see  Fq.  (21]  in 
order  to  keep  the  filter  gains  associated  with  the  parameters 
from  going  to  zero.  The  values  of  the  process  noise  variance 
for  the  parameters  are  chosen  try  tuning  the  filter  to  obtain 
best  performance.  The  discrete  stochastic  equation  is 

T-i  =  F(0.x)x,  +  w,  (39) 

where  vv,  is  composed  of  the  si.x-dmien stone  1  sector  »■,  associ¬ 
ated  with  the  states  and  the  eight-dimensional  vector  if. 
associated  with  the  parameters.  Note  that  »•,{/).  u  -At),  u\(t), 
and  wy,(i)  are  elements  of  if,  that  hare  the  state-dependent 
forms 

H  j(')  “  (C,e  -r  C-ty.  ),*,,(/)  -  D,.u,.( ;) 
w4(  i)  =  (  Cze  +  C-c,  ) , *ru ( i )  -  //.tv,  ( i ) 

Ms(‘)  =  <*>„t0)-  »<,(') 

while  the  remaining  noise  processes  are  assumed  independent. 

The  process  noise  covariance  matrix  Q,  —  E\  v  u '  ]  has 
values  along  its  diagonal  and  the  only  nonzero  oT-diagonal 
elements  are 

Qv*-Q*.:~l>,HeQyi 

Qys-Qt.i~R4fiiQ.At 

Q i.ii  =  Q  is.!  =  i'Qu.n 

Ql  14  =  Qlt  4  ~  14.14 

i  =  C,e  d  C2e, 

"here  the  dependence  on  the  time  stage  1  has  been  sup¬ 
pressed.  Since  some  of  the  olf-diagonal  elements  of  (/,  depend 
on  the  parameters  or  states,  they  must  be  approxim  ited  The 
adaptive  form  of  the  process  noise  covariance  matrix  sug¬ 
gested  in  Ref.  -  uses  the  current  estimates  of  the  parameters 
to  adapt  (Jr 

The  first  choice  for  the  process  noise  standard  deviations 
associated  with  the  states  other  than  wind  mists  was  to  set 
them  to  zero,  since  no  modeled  proee.s  none,  exist  on  any  of 
the  states  except  wind  gusts  However,  to  enhance  the  MGEKF 
performance  in  the  presence  of  modeling  ir.iccur.vies.  -trail 
power  spectral  densities  are  assumed  Although  the  same 
reasoning  can  he  usd  in  choosing  the  stand. ml  deviations 


c  0  0  0 

ii  e  o  0 

1*  (I  C'.e,  0 

t)  0  0  '  C'.e 

0  0  o  0 
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associated  with  the  estimated  parameters,  it  was  disco' «vd 
experimentally  that  setting  the  standard  deviati  -n  of  the 
parameters  to  about  the  magnitude  of  the  di.Tciem. ,■  h,  tween 
the  minimum  and  maximum  values  of  the  parameters  oxer  the 
flight  envelope  yielded  the  best  remits. 

V.  Computer  Simulation  and  Control  Sv  stem 
Design  Consideration 

The  aircraft  simulation  routine  provides  the  MGEKF 
routine  with  acceleration  A.  pitch  rate  a.  commanded  elevator 
position  c, .  forward  velocity  u.  and  altitude  h.  The  MGEKF 
uses  the  measurements  of  A.  q,  and  e,  direct. v,  while  the 
wind  gust  model  uses  u  and  h  to  calculate  wir.u  gust  coeffi¬ 
cients  in  /'( r, .  .xy). 

The  simulation  uses  an  exact  discrete  form  of  the  coiitum- 
ous  dynamic  equations.  The  trim  biases  are  not  included.  The 
dynamic  system  is  persistently  excited  by  an  oscillatory  dither 
of  the  elevator  input  in  order  to  estimate  the  parameters.  The 
dither  signal  maintains  an  adequate  signal-to  noise  ratio  in  the 
absence  of  pilot  input,  enabling  the  niter  to  differentiate 
between  the  response  of  the  dynamical  system  and  noise  on 
the  system.4 

Shape,  amplitude,  and  frequency  arc  the  three  major  aspects 
of  the  dither  signal  important  to  the  performance  of  the 
MGEKF.  A  sinusoidal  dither  signal  composed  of  three  fre¬ 
quencies  gave  good  results.  Two  frequencies  are  at  the  high 
(4.3  rad/s)  and  low  <1.8  rad/s)  ends  of  the  expected  short- 
period  frequency  of  the  aircraft  over  the  flight  envelope.  The 
third  is  at  the  frequency  of  the  higher-order  actuator  (20 
rad/s)  because  experimentation  indicated  that  the  parameters 
associated  with  the  control  (St,  S.r  Dc .  and  I!r)  .ire  more 
easily  identified  if  a  frequency  corresponding  to  the  natural 
frequency  of  the  actuator  is  included  in  the  dither.  U-c  of 
these  frequencies  results  in  the  improved  performance  of  the 
MGEKF.  which  allows  a  decrease  in  the  amplitude  ol  the 
dither  signal  while  Mill  maintaining  performance. 

In  order  to  determine  the  effect  of  sensor  accuracy  on  the 
performance  of  the  filter,  the  accelerometer  noise  standard 
deviations  arc  alternately  set  to  0.(203  and  0.03  g.  The  pitch 
rate  gyro  noise  standard  deviations  are  alternately  set  to  0.004 
and  0.04  deg.'s  Finally,  three  levels  of  clear  air  turbulence  xvere 
considered:  a  =  WG~  I,  5,  and  15  ft/s. 

The  objective  of  our  numerical  experiments  are  to  show 
how  the  MGEKF  tracks  the  -tates  and  stability  deviations 
through  a  change  in  flight  conditions  from  an  aiumde  of 
15.00  !  It  and  a  Mach  number  of  0  6  to  an  altitude  of  500  ft 
and  a  Mach  number  of  0  (A).  In  the  linear  simulation,  the 
transition  from  one  ’light  condition  to  the  other  was  formed 
either  as  .1  s’cp  or  j  ranw.  The  ramp  is  essentially  .1  linear 
inter).. ilatio:'.  between  the  parameters  associated  with  c.uh 
flight  condition  over  a  6  .  period.  Since  onlv  0  „•  dithc:  signal 
excites  the  aircraft  derma  the  transition,  it  i>  cxpciled  that  the 
performance  shown  here  is  soiiiewh.it  conservative,  since 
changes  in  Might  v  editions  require  control  aip-ats  ih  .t  will 
generate  additional  acceleration  and  pivh  rate 

The  I1n.1l  , ond-tions  of  a  j ■  ’  s  ,  .mergence  run  .ire  used  .is 
the  tni'ial  condition-  for  cash  run  This  rc|  r  wents  s.v< • 
'■'ill  1'  starting  condition  A  ramp-  change  from  one  aircraft 
I  light  con  Jit. on  to  jn.  tlier  occurs  between  4  0  and  1’  •  1 )  s.  «  ;th 
no  eh  inees  made  f ri -in  bill  -  u.  the  end  of  the  run  at  Mi'  s 
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In  the  following  figures,  solid  lines  represent  the  estimated 
parameter  and  the  dashed  lines  represent  the  actual  parame¬ 
ter.  Also  printed  on  these  figures  are  DYI  and  DY2,  the 
standard  deviations  of  the  measurement  noise  on  the  accel¬ 
erometer  and  the  pitch  rate  gyro,  respectively.  \VG  is  the  rms 
value  of  the  wind  gusts,  which  indicates  the  process  noise  on 
the  system  The  sample  frequency  is  100  Hz. 

Only  the  parameters  D ,  and  //,  are  used  to  compare  results 
of  variations  in  system  design  and  model  design,  because  Dr  is 
a  good  indicator  of  the  tracking  characteristics  of  the  other 
parameters  associated  with  the  acceleration  equations  and  //, 
is  a  good  indicator  of  the  pitch  rate  parameters. 

A  step  jump  in  flight  conditions  shown  in  Fig.  1  indicates 
the  step  response  characteristics  of  the  MGEKF.  The  slow 
response  in  D,  is  characteristic  of  the  parameters  associated 
with  the  acceleration  state.  This  is  due  to  the  wind  accelera¬ 
tion  term  in  the  acceleration  equation.  In  contrast,  note  the 
rapid  response  of  the  estimated  parameter  He. 

The  Dryden  wind  gust  model  is  obtained  empirically  from 
many  atmospheric  studies10;  therefore,  the  wind  gust  char¬ 
acteristics  of  the  real  atmosphere  will  not  exactly  correspond 
to  the  assumed  wind  gust  model  in  Eqs.  (22)  and  (28).  This 
fact  must  be  considered  when  analyzing  the  performance  of 
the  MGEHF,  since  its  performance  may  suffer  if  the  actual 


- ESTIMATED  PARAMETERS  DY1  =  0.03  g's 

. ACTUAL  PARAMETERS  DY2  =  0.04  deg  ' sec 


TIME  (SEC) 

Fig.  5  Parameter  tracking  with  a  ramp  change  in  Might  conditions, 
WG  =  5  ft/s,  reduced  accuracy  instruments,  and  low-amplitude  dither 
signal. 
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ESTIMATED  PARAMETERS  DY1  =  0.Cn  gs 
ACTUAL  PARAMETERS  DY2  =  0.04  ci  e  p  /  s  e  c 


TIME  (SEC) 

Fig.  8  Parameter  tracking  "1111  a  ramp  change  in  Right  conditions, 
WG  =  5  f t /s  in  filter.  \VG  =  15  ft/s  in  simulation,  reduced-accuracy 
instruments,  and  low-amplitude  dither  signal. 


wind  gust  characteristics  are  mismatched.  The  effects  are 
negligible  at  WG  =  5  ft/s  when  a  run  having  a  15.000  ft 
mismatch  between  the  actual  wind  gust  dynamics  and  the 
wind  gust  model  is  compared  with  the  matched  case  witn  ail 
other  factors  being  the  same. 

For  the  same  dither  signal  and  instrument  accuracy.  Figs.  2 
and  3  show  the  effect  of  changing  clear  air  turbulene  from 
WG  =  1  to  5  ft/s,  where  the  corresponding  rms  acceleration 
over  the  20-s  flight  for  each  profile  is  0.0926  and  0.1195  g, 
respectively.  The  effect  of  reduced  instrument  accuracy  on  the 
low-amplitude  dither  signal  is  shown  in  Figs.  4  and  5.  Note 
that,  with  clear  air  turbulancc  of  WG  =  5  ft/s,  comparing 
Figs.  3  and  5  shows  that  the  effect  of  an  improved  measure¬ 
ment  device  has  little  effect  on  D,  but  has  some  effect  on 
The  effect  of  an  increase  in  dither  signal  amplitude  is  shown  in 
Fig.  6,  where  the  rms  acceleration  over  the  20-s  flight  is  0.4  g. 
Note  that  now  Figs  2  and  6  have  about  the  same  profile  ev  en 
with  reduced  instrument  accuracy.  However,  large  accelera¬ 
tions  and  pitch  rates  produce  pilot  and  passenger  discomfort. 
Finally.  Figs.  7  and  8  show  the  mismatched  performance  of 
the  filter  of  Fig.  5  (WG  =  5  ft/s)  with  dear  air  turbulence  in 
the  simulation  is  lower  (WG  =  1  ft/s)  and  higher  (WG  =  15 
ft/s)  than  WG  =  5  ft/s,  respectively.  Note  that  R ,  in  Eq. 
(31),  which  can  be  determined  from  the  estimated  parameters, 
need  be  chosen  onlv  at  some  nominal  value  since  WG  is  never 
known.  That  is,  if  R,  a,  is  chosen  as  a  state  variable,  then  the 
wind  intensity  becomes  8,  WG  The  results  shown  are  for  a 
Monte  Carlo  average  of  10  runs. 

VI.  Conclusion 

Although  additional  investigation  of  the  modified-gain  ex¬ 
tended  Kalman  filter  (MGEKF)  is  needed,  the  results  from 


this  study  indicate  that  the  MGHKF  displays  reasonable 
performance  in  the  areas  of  convergence  characteristics,  dis¬ 
turbance  rejection,  and  response  to  system  changes.  The  slow 
response  of  the  parameter  estimation  error  associated  with  the 
accdcrat'on  equation  is  due  mostly  u  the  inclusion  of  the 
high-frequency  gust  acceler  tion  term.  Improvements  might 
be  made  by  adding  an  anglc-of-attack  meter  tiiat  measures  the 
relative  motion  between  the  aircraft  and  the  air  mass.  There¬ 
fore,  the  MGEKF  seems  well  suited  to  application  in  an 
adaptive  control  scheme.  'Die  MGEKF  can  provide  state  and 
parameter  estimates  to  a  set  of  control  laws  that  use  these 
estimates  to  adapt  to  changing  flight  conditions.  The  con¬ 
troller  should  be  designed  to  rely  most  heavily  on  the  parame¬ 
ters  associated  with  pitch  rate.  This  is  not  unreasonable  since 
//,. ,  which  is  essentially  the  change  in  moment  due  to  elevator 
deflection,  is  estimated  well  and  is  important  in  designing 
responsive  flight  control  systems. 

Acknowledgment 

This  work  was  sponsored  by  General  Dynamics,  Fort  Worth 
Division,  and  the  U.S.  Air  Force  Office  of  Scientific  Research 
under  Grant  AFOSR-84-0371. 


References 

1  Denerv,  D.G.,  "  Identification  of  System  Parameters  From  Input- 
Output  Data  With  Applications  to  Air  Vehicles,"  NASA  TN  D-6468, 
Aug.  1971. 

;Redicr.s,  H.A.,  “An  Overview  of  Parameler  Estimation  Techniques 
and  Applications  in  Aircraft  Flight  Testing."  Proceedings  of  Sym¬ 
posium  on  Ptirameicr  Estimation  Techniques  and  Applications.  NASA 
TN  D-~647.  April  1974,  pp.  1-18. 

1  IlitT.  K.W.  and  Maine.  R.E.,  “Practical  Aspects  of  Using  a  Maxi¬ 
mum  Likelihood  Estimation  Method  to  Extract  Stability  and  Control 
Derivatives  From  Flight  Data,”  NASA  IN  li  A 209,  1976 

4Liung,  L.  and  Soderstrom,  T.,  Theory  and  Practice  of  Recursive 
Identification,  Massachusetts  Institute  of  Technology  Press.  Cam¬ 
bridge,  1983. 

5Sanuis.  G  H  .  ‘Comparison  of  Six  On-Line  Identification  Algor¬ 
ithms."  A  utonialica .  Vol.  10.  1974. 

6 Song,  T  1.  and  Spc'-ci,  J  L.,  “A  Stochastic  Analysis  of  a  Modified 
Clam  Extended  Kalman  Filter  with  Applications  lo  Estimation  with 
Bearings  Or.lv  Measurements,”  IELL  Transactions  on  Automatic  Con¬ 
trol.  Vol  AC-30,  Oct.  1985. 

’Song.  T  L.  and  Speyer.  J.L.,  “The  Modified  Gain  Extended  Kal¬ 
man  Filter  and  Parameler  Identification  in  Linear  Systems,"  Auto- 
mattca.  Vo!.  22.  No.  1,  Jan.  1986.  pp.  59-75. 

"Speyer.  J  L  .  White,  J.E.,  Douglas.  R„  and  Hull,  D.G.,  "Multi- 
Input  Multi-Output  Controller  Design  for  Longitudinal  Decoupled 
Aircraft  Motion,"  Journal  of  Guidance,  Control,  ami  Oynanucs,  Vol.  7, 
Nov. -Dee.  1984,  pp.  695-702. 

‘'Chalk.  C  R„  Neal.  T.P ,  Harris,  T.M..  Pritchard.  F.E.,  and  Wood¬ 
cock.  R  J..  “Background  Information  and  Users  Guide  for  MIL- 
F-S785B  (ASG),"  AFFDL-TR-69-72,  Aug.  1969,  pp.  417-461. 

’"Holley,  W.E.  and  Bryson,  A.E.,  “Wind  Gust  Modeling  and 
Lateral  Controt  for  Automatic  Landing.”  Journal  of  Spacecraft  and 
Rockets.  Vol  14,  Feb  1977,  pp.  65-72. 

"Chen,  C  .  Introduction  lo  lanear  Systems  Theory  .  Hold.  Rinehart, 
&  Wmston.  New  York.  1970. 


IEEF  TRANSACTIONS  ON  AUTOMATIC  CONTROL.  VOL  AC-32.  NO  7,  JULY  1987 


593 


Detection  Filter  Design:  Spectral  Theory  and 
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Abstract— A  new  formulation  of  the  detection  filter  problem  is 
generated  by  assignment  of  the  closed-loop  eigenstructure  under  certain 
constraints.  Detection  filters,  which  are  actually  a  specific  class  of 
observers,  fix  the  output  error  direction  of  the  system  so  (hat  it  can  be 
associated  with  a  particular  failure  mode  and  its  known  design  failure 
direction.  The  derivation  of  detection  filters  from  an  eigensystem 
assignment  approach  permits  a  very  transparent  theory  .  The  detection 
filler  gains  and  closed-loop  eigenvectors  are  obtained  from  a  set  of 
simultaneous  equations.  Necessary  and  sufficient  conditior  s  for  the 
solution  of  these  algebraic  equations  are  determined  which  produce  a 
complete  theory  for  detection  filters. 


I.  Introduction 

THE  design  of  reliable,  fault-tolerant  control  systems  requires 
that  system  failures  be  detected  and  identified  within 
acceptable  time  limits,  such  that  the  system  feedback  is  not 
excessively  corrupted.  The  principal  tradeoff  to  be  made  in 
designing  a  redundancy  management  scheme  is  that  of  hardware 
redundancy  versus  the  complexity  and  robustness  problems  of  the 
software  for  analytic  redundancy  (i.e. .  combining  the  outputs  of 
dissimilar  devices  through  analytic  kinematic  and  dynamic  rela¬ 
tionships  to  obtain  redundancy).  A  survey  on  design  methods  for 
failure  detection  is  given  in  1 1],  Analytic  redundancy  management 
schemes  are  developed  by  forming  and  processing  failure  resid¬ 
uals.  These  residuals  are  essentially  zero  if  no  failure  occurs  and 
are  nonzero  if  a  failure  occurs.  The  residual  formation  techniques 
in  the  literature  may  be  categorized  into  two  broad  groups.  Open- 
loop  schemes  (2|  form  one  group.  These  schemes  involve  the 
construction  of  a  set  of  parity  equations  which  represent  all  of  the 
analytical  redundancies  of  a  system.  These  parity  equations  are 
simply  all  ^  the  possible  input-output  relationships  of  a  given 
linear  system.  A  generalized  parity  space  [2]  can  be  formed  from 
the  parity  equations,  and  in  the  presense  of  a  failure  the  resulting 
parity  errors  combine  to  provide  a  failure  signature  with  direc¬ 
tional  characteristics  in  addition  to  the  usual  residual  magnitude 
information.  Theoretically,  these  directional  signatures  should 
facilitate  the  failure  detection  and  identification  process.  How¬ 
ever,  the  open-loop  parity  error  characteristics  are  of  a  highly 
temporal  nature  and,  therefore,  the  directional  failure  signature  is 
not  generally  constrained.  Furthermore,  the  failure  magnitude  of 
some  or  all  of  the  parity  residuals  may  disappear  after  n  or  fewer 
sample-times  (n  is  the  dimension  of  the  state  space).  These 
problems  would  seem  to  limit  the  usefulness  of  the  open-loop 
parity  space  concepts. 

The  second  category  of  residual  formation  techniques  is  that  of 
closed-loop  schemes.  Although  any  linear  filter  residual  could  be 
processed,  one  particular  type  of  filter  produces  residuals  with 
directional  characteristics  that  can  readily  be  associated  with  some 
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known  failure  mode.  These  filters  are  known  as  detection  filters, 
but  are  actually  a  particular  class  of  observers.  Unlike  the 
directional  failure  signatures  of  the  open-loop  parity  space 
method,  detection  filters  act  in  a  closed-loop  fashion  to  fix  the 
output  direction  associated  with  plant  jind  actuator  failures  while 
restricting  sensor  failure  output  directions  to  lie  in  a  plane. 
Furthermore,  the  output  error  magnitude  never  completely 
disappears  when  a  failure  has  occurred. 

The  original  theoretical  development  in  detection  filters  was 
completed  by  1973  (3),  |4|.  The  intent  of  this  paper  is  to 
reformulate  the  detection  filter  theory  of  [3|,  (4)  as  an  eigensys¬ 
tem  assignment  problem.  The  algorithms  of  |3],  [4]  take  the 
relatively  indirect  approach  of  generating  a  cyclic  space.  The 
current  approach  produces  a  straightforward  derivation  which 
yields  a  system  of  simultaneous  linear  equations  to  be  solved  for 
the  detection  filter  gains  and  the  closed-loop  eigenvectors,  once 
the  closed-loop  eigenvalues  have  been  assigned.  The  detection 
filter  terminology  and  certain  referenced  pans  of  this  paper  are 
taken  from  [3],  [4] .  Our  results  parallel  those  in  multivariable 
control  system  design  based  on  cnoosing  the  closed-loop  eigen¬ 
structure  to  determine  a  unique  feedback  gain  matrix  [51.  Moore 
[5]  has  shown  that,  in  addition  to  the  usual  freedom  to  choose  the 
closed-loop  eigenvalues,  the  closed-loop  eigenvectors  can  be 
chosen  from  an  w -dimensional  subspace  when  there  are  m  control 
inputs.  Moore  uses  this  flexibility  in  the  choice  of  the  eigenvectors 
to  propose  a  design  scheme  for  adjusting  the  distribution  of  the 
modes  among  the  output  components  so  as  to  shape  the  response 
characteristics  of  the  system.  This  paper  demonstrates  that  similar 
eigensystem  assignment  freedoms  and  design  algorithms  exist  for 
a  particular  class  of  observers,  known  as  detection  filters,  which 
can  be  completely  defined  by  specification  of  a  set  of  closed-loop 
eigenvalues  along  with  appropriate  constraints  on  the  eigenvec¬ 
tors.  Although  there  is  a  good  deal  of  literature  on  eigenstructure 
assignment  for  both  state  and  output  feedback,  the  constraints 
imposed  here  on  the  observer  gains  require  alternate  derivations 
and  algorithms.  For  additional  detail,  see  [6], 

In  the  next  section  definitions  to  establish  notation  and  to 
introduce  basic  failure  modeling  considerations  are  presented.  In 
Section  III  the  algorithm  for  determining  the  detection  gains  and 
closed-loop  eigenvectors  is  established.  However,  this  algorithm 
assumes  that  the  eigenvalues  can  be  arbitrarily  assigned.  If  a 
certain  condition  is  not  met,  the  algorithm  must  be  further 
generalized.  This  is  the  topic  of  the  subsequent  sections.  An 
example  is  used  to  illustrate  all  the  theory  Although  the  detection 
filter  was  analyzed  in  [3]  and  [4],  the  proofs  of  the  theorems  and 
resulting  algorithms  for  determining  the  detection  gains  and 
closed-loop  eigenvalues  by  the  eigensystem  assignment  method 
are  generally  different. 

II.  System  Definition 

The  open-loop  dynamic  model  in  the  absence  of  failures  is 
given  by 

x  =  Ax  +  Bu  (I) 

where  x  is  an  n  x  1  state  vector  The  measurement  equation  in 
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the  absence  of  sensor  failures  is  written  as 

y  =  Cx  (2) 

where  v  is  an  m  x  1  measurement  vector.  The  detection  filter  is 
assumed  to  have  the  form  of  a  linear  filter  such  that 

x  =  Ax  +  Bu  +  D(y-  Car)  (3) 

where  .v  is  the  state  estimate  and  D  is  the  detection  gain.  If  the 
state  error  is  defined  as  e  a  x  -  x,  then  D  is  to  be  chosen  such 
that  the  output  error,  e  =  y  -  C.i,  has  restricted  directional 
properties  in  the  presence  of  a  failure.  The  closed-loop  dynamic 
equation  becomes  e  =  Ge  when  there  are  no  failures,  where 

G  k  A- DC.  (4) 

The  occurrence  of  a  plant  or  actuator  failure  can  usually  be 
modeled  bv  a  single  term  added  to  (1)  to  produce 

x  =  Ax  +  Bu+fn,  (5) 


direction  is  defined.  This  definition  requires  that  for  arhitrarilv 
placed  filter  poles,  the  gains  be  determined  so  that  a  failure 
direction  induces  a  unique  measurement  residual  direction.  To 
ensure  that  the  fault  directions  are  detectable,  certain  general 
assumptions  are  imposed  on  the  sy  stem.  In  later  sections,  some  of 
these  assumptions  are  removed.  The  restrictions  imposed  on  the 
system  by  the  assumptions  and  the  requirement  of  detectability 
force  groups  of  eigenvectors  to  produce  output  residual  directions 
identical  to  those  produced  by  the  various  fault  directions.  The 
number  of  eigenvectors  associated  with  each  output  residual 
direction  is  determined  by  computing  the  dimension  of  the 
associated  detection  space.  If  the  sum  of  the  dimensions  of  all  the 
detection  spaces  adds  up  to  the  dimension  of  the  state  space  (this 
property  is  referred  to  as  mutual  detectability),  then  the  filter 
eigenvalues  can  be  arbitrarily  assigned,  and  a  simple  algorithm  tor 
determining  the  detection  gains  and  the  closed-loop  eigenvectors 
is  developed.  In  Section  VI  techniques  for  making  nonmutuaily 
detectable  problems  mutually  detectable  are  developed. 

A.  Failure  Detectability 


$ 


where  /  is  the  rt  x  1  design  failure  direction  associated  with  the 
/th  plant  or  actuator  failure,  and  n,  is  generally  a  time-varying 
scalar  which  may  be  a  function  of  x(t)  or  u(t).  A  priori 
knowledge  of  u,  >s  not  required  in  the  design  of  a  detection  filter 
and  it  is  assumed  that  p,(t)  is  an  arbitrary  function  of  time. 
However,  knowledge  of  the  failure  magnitude  characteristics  may 
be  useful  for  distinguishing  between  different  failures  with  the 
same  output  directions.  For  plant  or  actuator  failures,  the  error 
system  is  rewritten  as 


(  =  G(  +/im„ 


The  detection  gain  D  will  be  determined  in  the  following  sections 
so  that  e  is  proportional  to  Cf,  in  response  to  a  failure 
corresponding  to  that  modeled  by  the  direction  /.  This  output 
direction  is  maintained  during  the  transient  (assuming  that  the 
transients  due  to  the  initial  conditions  have  settled-out  before  the 
failure  occurs)  and  steady-state  phases  of  the  error  response  due  to 
the  system  failure. 

The  occurrence  of  a  sensor  failure  can  generally  be  modeled 
with  a  single  term  added  to  (2)  as 


y  =  Cx  +  e,n/ 


where  e,  is  an  m  x  1  unit  vector  corresponding  to  a  failure  in  the 
/th  sensor.  For  sensor  failures  the  error  system  becomes 


-G(-d,n„  e  =  Ce  +e,n, 


where  d,  is  the  /th  column  of  the  detection  filter  gain  matrix.  The 
presence  of  d,  in  (8)  is  a  potential  difficulty  since  the  detection 
gain  is  not  known  a  priori.  The  objective  of  the  design  procedure 
for  a  sensor  failure  is  to  determine  two  a  priori  directions 
associated  with  a  failure  in  the  /th  sensor  such  that  the  output 
errors  lie  somewhere  in  the  plane  defined  by  Cd,  and  e,. 
Therefore,  the  closed-loop  error  system  of  (8)  can  be  replaced  by 
a  system  of  the  form 


Ge  -/>.+/. 


where  /  is  any  direction  such  that  e,  k  Cf,  and  f*  =  A  f.  It  is 
shown  in  Section  III-D  that  CA  f  lies  in  the  plane  generated  by 
Cd,  and  Cf. 

The  error  system  of  (6)  is  used  in  the  remaining  analysis  since  it 
is  easily  generalized  to  sensor  failures  or  other  circumstances 
which  are  described  by  multiple  failure  directions. 


III.  Detection  Filter  Design 


In  this  section  the  algorithm  for  determining  the  detection  gain 
D  is  developed.  First,  the  notion  of  detectability  of  a  fault 
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The  development  of  the  theory  of  detection  filters  from  (he 
eigensystem  assignment  approach  begins  w  ith  a  definition  of  the 
basic  requirements  for  a  detection  filter.  The  definition  of  the 
detectability  of  a  failure  with  the  design  direction  /  is  given  by 
Beard  [3]  as  stated  below. 

Definition  I:  The  failure  associated  with  /  in  the  system 
described  by  (6)  is  detectable  if  there  exists  a  filter  gain  matrix  D 
such  that 

a)  e(t)  maintains  a  fixed  direction  in  the  output  space,  and 

b)  all  eigenvalues  of  G  can  be  arbitrarily  specified,  except  for 
the  constraint  on  the  conjugate  symmetry. 

Condition  a)  forces  the  filter  to  have  properties  such  that  the 
output  error  direction  t  can  be  associated  with  the  design  error 
direction /.  Condition  b)  is  imposed  so  that  the  filter  can  be  made 
stable,  and  also  so  that  the  response  time  of  the  filter  can  be 
adjusted.  If  condition  b)  is  satisfied,  the  detection  "filter''  can 
also  be  used  as  a  state  estimation  observer.  The  conjugate 
symmetry  constraint  will  restrict  the  closed-loop  eigenvectors 
associated  with  complex  conjugate  eigenvalues. 

B.  System  Qualifications 

The  assumptions  upon  which  the  analysis  of  Sections  III-C  to 
III-E  are  based  are 

1)  (A,  C)  is  an  observable  pair. 

2)  CF  a  [Cf,  •  • Cf]  is  rank  r. 

3)  r  =  m. 

4)  The  closed-loop  eigenvalues  of  G,\„  i  =  1,  2,  ■  ■  -,  n  are 
distinct. 

The  observability  restriction  is  required  for  the  usual  state 
estimation  reasons.  The  assumption  that  CF  be  rank  r  will  be 
referred  to  as  the  condition  of  output  separability  [3] .  The  output 
separability  condition  produces  a  relatively  uncomplicated  closed- 
loop  structure  (Lemma  3).  If  CA’f,  =  0  for y  =  0,  1,  •  •  ■ ,  6  -  I 
and  if  CAhf,  *  0.  then  all  of  the  Cf  *  0  assumptions  and 
procedures  can  still  be  used  if  /  is  replaced  every  where  by  A  '/ 
[3],  [4],  [6|.  Furthermore,  if  the  output  separability  assumption  is 
not  satisfied  because  two  failure  directions  of  interest  have 
identical  output  directions,  the  dynamics  of  the  system  may  still 
allow  for  the  detection  of  the  two  failure  directions  with  a  single 
detection  filter.  As  a  design  procedure,  one  or  both  of  the  original 
directions  can  be  replaced  with  A’f  for  some  j  >  0  such  that 
Assumption  2  is  satisfied.  The  assumption  is  made  m  Sections  Ul 
C  to  III-E  that  r  =  m  since  it  is  generally  desirable  to  identify  the 
maximum  number  of  failures  possible  with  a  single  detection 
filter.  The  r  <  m  case  will  be  considered,  however,  in  Section 
IV  The  addition  of  a  set  ot  nonoutput -separable  directions  to  / 
will  be  described  in  Section  VII.  Some  constraints  on  eigenvalue 
assignability  generally  must  be  accepted  to  arid  these  directions  to 
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the  original  set  of  r  output  separable  directions.  The  analysis  for 
the  case  where  the  eigenvalues  are  not  all  distinct  will  be  given  in 
Section  V. 

C.  Detectability  and  Closed-Loop  Eigenvector  Constraints 

The  following  analysis  imposes  the  requirements  of  Definition 
1  by  placing  constraints  on  the  eigenstructure  of  the  error  systems. 
The  distinct  closed-loop  eigenvalues  and  eigenvectors  vt  of  (4) 
are  determined  by 

(A,/-G)e,  =  0  (10) 

for  j  =  1,2,  •  • ,  n,  where  Ce;  *  0  since  the  system  is  assumed 
to  be  observable.  The  iy  are  independent  and  span  the  error 
variable  state  space.  The  design  failure  vector  can  therefore  be 
written  as  a  linear  combination  of  the  closed-loop  eigenvectors  as 

n. 

<n> 

/■i 

where  the  /  superscripts  denote  those  closed-loop  eigenvectors  and 
coefficients  which  combine  to  form  /,  and  n,  <  n  is  the  number 
of  nonzero  a' . 

An  important  restriction  on  the  eigenvectors  v‘  is  given  by  the 
following  theorem  which  is  proved  in  Appendix;A. 

Theorem  1:  Condition  a)  of  Definition  1  is  satisfied  if  and  only 
if  Cf  and  Cif  are  colinear  for  all  values  of  j  =  l,  2,  •  •  • ,  n,. 

Remark:  It  will  be  convenient  to  normalize  the  v‘  such  that  Cv' 
=  Cf,  for  all  j  =  1,  ■  •  n,.  1  1 

Since  Theorem  1  can  essentially  be  viewed  as  placing  con¬ 
straints  on  the  closed-loop  eigenvectors  to  achieve  a  unidirectional 
output  error  while  condition  b)  of  Definition  1  requires  that  the 
eigenvalues  be  arbitrarily  assignable,  the  detection  filter  problem 
becomes  that  of  solving  the  set  of  equations 

SIKH:]  » 

for  the  detection  gain  D.  where  w,  I  C/  =  Cv'  with  j  =  1,2, 

‘ 1  ’ ,  h,  >  n,  and  i  =  1 ,  2,  ■  •  •  m.  However,  the  number  of 
eigenvalues  and  eigenvectors  h,  to  be  assigned  to  the  Cf, 
constraints  remains  to  be  determined.  The  conditions  under  which 
a  solution  to  (12)  can  be  obtained  for  D  and  the  vfs  will  be  given 
in  Sections  III-D  and  III-E.  These  conditions  are  the  same  as  those 
required  by  Beard  for  the  solution  of  his  formulation  of  the 
problem.  Interestingly,  the  appropriate  results  are  easily  derived 
here  even  though  the  formulation  is  different  (i.e. ,  (13)  below 
differs  from  Beard's  form  of  that  equation). 

D.  Detection  Spaces 

The  calculation  of  the  detection  gain  and  h,  with  respect  to  a 
single  design  failure  direction  is  now  examined.  Those  equations 
of  (12)  which  correspond  to  the  nonzero  a'’s  in  (II)  can  be 
summed  to  obtain 

DCf  =  Af-  j  a'X'v'  (13) 

/  *  * 

where  the  remark  follow ing  Theorem  I  and  (II)  have  been  used  to 
show  that  1"'.  |  a '  =  I  for  all  a'  *  0  corresponding  to  /,.  The 
form  of  the  solution  of  (13)  can  be  used  to  obtain  certain 
information  useful  to  the  solution  of  (12). 

Lemma  I:  [f  O,  S,  and  Q  are  matrices  of  dimension  n  x  m,  m 
x  r,  and  n  x  r,  respectively,  where  n  >  m  2  rand  rank  (S)  = 
r,  then  the  general  solution  of  DS  -  Q,  is  given  by 

D  =  QS'  fDIZ-SS*)  (14) 

where  D  is  an  arbitrary  n  x  m  matrix  and  represents  the  freedom 
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left  in  D  after  satisfying  DS  =  Q ,  and  S*  k  (STS)~  'Sr  is  the 
Moore-Penrose  pseudo  left  inverse  [7], 

A  proof  of  this  lemma  is  given  in  [3],  [6],  The  reader  is  referred 
to  [8]  for  a  more  thorough  discussion  of  the  generalized  inverse 
problem. 

The  solution  of  (13)  is  given  by  Lemma  1  as 

£=(/!/,-£  a'X'i +  D,P(Cf)  (15) 

where  P(S)  k  I  -  SS'  and  5  =  Cf,.  Equation  (15)  cannot  be 
employed  to  directly  solve  for  D  since  the  summation  term  is 
unknown.  However,  once  D  has  been  chosen  so  as  to  satisfy  the 
detection  filter  constraint  due  to  /  in  (13),  a  new  system  can  be 
defined  with  a  filter  gain  of  D,.  This  new  system  has  the  same 
form  as  the  original  problem  and  is  useful  because  it  allows  the 
detection  filter  designer  to  determine  how  many  eigenvalues  can 
be  arbitrarily  specified  by  the  choice  of  D,,  and  the  number  of 
eigenvalues  associated  with  Cf,  in  (12).  The  new'  system  is 
determined  by  substituting  (15)  into  (4)  to  obtain 

A  -  DC  =  A,  -  D,C,  (16) 

A,  k  A-^Af-^  C'Vv^iCfVC  (17) 

c,  k  P(Cf)C  (18) 

and  is  characterized  by  the  following  lemma. 

Lemma  2:  If  A,,  C„  and  D,  are  real  matrices  of  dimension  n  x 
n,m  x  n,  and  n  x  m,  respectively,  the  number  of  eigenvalues  of 
(A,  -  D,C ,)  which  can  be  arbitrarily  specified  by  the  free  choice 
of  D,  is  equal  to  q,  i  rank  (Si,)  where 

M,  k  [( C, )  T,  (C,A,)r,  •••,  (C,A;-')T)r.  (19) 

The  remaining  v,  k  (n  -  q,)  eigenvalues  of  (A,  -  D,C,)  are 
equal  to  the  corresponding  eigenvalues  of  A,,  which  also  are  those 
eigenvalues  of  G  associated  with  /.  The  proof  is  given  in 
Appendix  B. 

Remark:  Observe  that  Lemma  2  is  not  written  in  an  imple- 
mentable  form  for  this  problem  because  (17)  contains  the  iV’s 
which  are  unknown.  However,  A,  in  (19)  can  be  replaced  by  K,‘ s 
A[I  -  f(CfyC\.  The  equivalence  between  A,  and  K,  can  be 
established  by  using  C,K*v‘  =  0  for*  =  0,  1,  •  •  •,  n  -  1  [6], 

Definition  2:  The  null  space  of  M,  is  the  detection  space  off. 

Definition  3:  The  dimension  of  the  detection  space  of  /  is 
defined  to  be  the  detection  order  v,  of  /,  where  v,  =  n  -  q,. 

Definition  4:  The  failure  vector/^  is  detection  equivalent  to /, 
if 

a)  every  detection  filter  for/  is  also  a  detection  filter  for /,  and 

b)  Cf  =  QCfj  (Assumption  2  of  Section  Ill-B  implies  that  Cf, 
*■  0  and  Cf  0)  where  3  is  any  nonzero  constant. 

The  detection  space  is  a  G-invariant  subspace  of  the  error 
variable  state  space  which  represents  that  pan  of  the  system 
affected  by  /  or  some  detection  equivalent  direction.  The 
invariance  property  is  clear  from  (13)  since  the  summation  term 
represents  some  vector  in  the  detection  space  of  f  and  the  other 
two  terms  can  be  combined  to  form  Cf.  This  invariance  property 
implies  that  the  controllable  space  of  /  with  respect  to  G,  W, 
given  in  (/l-l),  is  a  subspace  of  the  detection  space  of/,  since  W' 
is  the  smallest  G-invariant  subspace  containing/.  The  fact  that 
W,  is  generally  a  proper  subspace  of  the  detection  space  is  the 
result  of  the  maximum  rank  of  W,  being  constrained  to  be  n,  <  v,. 
This  constraint  on  the  rank  of  W,  can  be  observed  from  the 
substitution  of  (11)  and  (10)  into  W,. 

The  detection  space  of  f  contains/,  and  all  of  the  /  which  are 
detection  equivalent  to/,  since  CJ,  =  0  and  K,f  =  0  imply  that 
/  and  the  detection  equivalent //s  lie  in  the  null  space  of  M,. 


5% 


1F.EE  TRANSACTIONS  ON  AUTOMATIC  CONTROL.  VOL  AC  t:.  NO  7.  JULY  1417 


Lemma  2  and  Definition  3  imply  that  there  are  v,  eigenvalues 
whose  corresponding  eigenvectors  span  the  detection  space  of  f. 
Since  n,  eigenvectors  of  ( 1 1)  are  known  by  Theorem  1  to  lie  in  the 
detection  space,  then  v,  >  n,  2  1.  Therefore,  since  there  are  v, 
eigenvectors  which  satisfy  the  eollinear  constraint  of  Theorem  1 
for  all  detection  equivalent  f's,  then  h,  =  v ,  is  the  number  of 
eigenvalues  and  eigenvectors  to  be  assigned  to  the  Cf  constraint 
in  solving  the  algebraic  equation  (12). 

It  remains  to  show  that  all  n  eigenvalues  can  be  arbitrarily 
assigned  as  required  by  Definition  1. 

Definition  5:  The  vectors  in  the  set  Fare  mutually  detectable  if 
there  exists  a  D  which  satisfies  the  detectability  conditions  of 
Definition  1  for  all  f,  in  the  set  F. 

The  condition  for  the  set  F  to  be  mutually  detectable  is  given  in 
Theorem  2  for  the  case  when  r  =  m. 

Theorem  2:  If  the  set  of  vectors  in  Fare  output  separable,  then 
the  f's  in  Fare  mutually  detectable  if  and  only  if 

m 

2 

i- 1 

The  proof  is  given  in  Section  IV  where  the  r  <  m  case  is 
discussed.  Procedures  for  making  a  system  mutually  detectable 
when  the  condition  in  Theorem  2  fails  is  given  in  Section  V. 

E.  Solution  of  the  Algebraic  Equation  (12) 

The  following  theorem  states  the  conditions  under  which  (12) 
can  be  solved  for  the  detection  gain  D  and  the  closed-loop 
eigenvectors.  The  proof  is  given  because  it  is  constructive  in 
developing  an  algorithm  for  the  solution  to  (12). 

Theorem  3:  Given  the  system  qualifications  of  Section  III-B, 
then  the  condition  S"  ,  v,  =  n  implies  that  the  system  of  (12)  can 
be  solved  for  the  detection  gain  matrix  D  and  the  closed-loop 
eigenvectors,  if,  where  j  =  1,  2,  •  •  •,  v,. 

Proof:  By  Theorem  2  the  n  eigenvalues  required  in  (12)  can 
be  arbitrarily  specified  v,  at  a  time,  while  simultaneously  being 
associated  with  a  particular  Cf  constraint.  Therefore,  for  each  f, 
n  linear  combinations  of  the  elements  of  D  can  be  determined 
along  with  the  nv,  elements  of  id's.  To  determine  these  unknowns, 
there  are  nv,  eigenvector  equations  and  mv,  eigenvector  con¬ 
straints.  When  mv,  2  n,  then  those  equations  in  (12)  correspond¬ 
ing  to  f  can  be  used  to  completely  solve  for  the  corresponding  u‘'s 
and  the  associated  set  of  n  constraints  on  D.  This  is  possible 
because  there  are  an  equal  number  of  independent  equations  and 
unknowns.  However,  if  mv,  <  n,  then  u'  is  representable  as  a 
linear  combination  of  any  basis  for  the  /th  detection  space.  Since 
the  /th  detection  space  must  be  orthogonal  to  M,  of  (19),  a  basis 
set  for  the  detection  space  can  be  found  by  computing  the 
unobservable  subspace  of  the  (C„  K,)  system  [9J.  Hence,  v1  can 
be  written  as  v)  =  where  Q,  is  an  n  x  v,  matrix  whose 
columns  form  a  basis  for  the  detection  space  with  respect  to /,  and 
(3'  is  a  p,  X  1  vector  of  coefficients.  It  is  now  shown  that  enough 
independent  equations  will  exist  for  solution  of  the  elements  of  the 
v)  =  and  the  n  constraints  on  D.  The  number  of  equations  in 
(12)  remains  at  nv,  +  mv,  while  the  number  of  uuxnowns  has  been 
reduced  to  v2  +  n.  If  the  number  of  equations  can  be  shown  to  be 
greater  than  or  equal  to  the  number  of  unknowns,  it  will  be 
possible  to  solve  for  the  |3'  and  the  constraints  on  D.  It  will  now  be 
assumed  that  this  is  true  so  that  [(rt  +  m)v, |  >  [p2  +  n\  which 
will  be  rewritten  so  that  the  validity  of  the  inequality  is  clear. 
When  r  <  m,  then  n  >  p,.  This  implies  that  n  -  v,  +  c  where  c 
>  0.  Hence,  Jr2  +  (m  +  c)p,|  2  (p2  +  v,  +  c|,  and  then  \m  + 
c\v,  >  (p,  +  c).  Since  m  2  1  and  v,  >  1,  then  the  inequality  is 
valid.  Hence,  solution  of  (12)  is  possible  and  Theorem  3  is 
proven. 

Remark:  As  discussed  in  Section  II.  sensor  failures  are 
included  by  determining  a  priori  fault  directions  f  and  /*  such 
that  Cf,  =  e,,  and  Cf*  lie  in  the  plane  composed  of  Cf,  and  Cd, 
Note  that  for/*  U  A  f  -  ij/,  where /  lies  in  the  detection  space 


of/  and  rj  is  an  arbitrary  constant.  Cf*  lies  in  the  plane  composed 
of  CDCf  =  Cd,  =  CV1  /  -  ,  a'k'e,  and  e,  where  (13)  is 

used.  '  ' 

IV.  r  <  m  Failure  Directions 

The  assumptions  of  Section  III-B  are  relaxed  to  allow  r  <  m 
The  condition  for  mutual  detectability  of  Theorem  2  must  be 
generalized.  This  is  done  by  determining  the  "detection  space" 
associated  with  the  set  F  =  {/,  •••,/,}.  In  a  manner  analogous 
to  that  for  the  single  failure,  the  detection  gain  for  multiple 
failures  must  simultaneously  satisfy  r  equations  of  the  form  of 
(13).  This  set  of  equations  can  be  written  in  matrix  form  as  DCF 
=  Qd  where  the  columns  of  Qd  are  the  right-hand  sides  of  (13)  for 
i  =  1,  2,  •  ■  • ,  r.  Lemma  1  gives  the  solution  of  DCF  =  Qd  as 

D  =  Qd(CFV  +DPICF)  (20) 

where 

A-DC=A-DC,  A  k  A-Qd(CF)'C  (21) 

C  k  P(CF)C ,  K  k  A  | /-  F(CF) * CJ.  (22) 

As  per  Lemma  2  and  the  following  remark,  the  observability 
matrix  with  respect  to  the  system  (A,  C)  is 

A/  =  |(C) r,  (Cd)7-,  •••,  (CA"-')T \T.  (23) 

The  number  of  eigenvalues  which  are  freely  assignable  by  D  is 
given  by  Lemma  2  to  be  rank  (A/)  U  q .  The  number  of 
eigenvalues  associated  with  making  D  a  detection  gain  for  the  set 
of  r  failure  vectors  in  F  is  therefore  given  by  v  =  n  -  q. 

Definition  6:  The  dimension  of  the  null  space  of  M\sv  =  n- 
q  and  is  defined  to  be  the  group  detection  order  of  the  set  F  = 
{/„  •••./,»• 

For  F defined  by  a  set  of  r  <  m  failure  directions  Theorem  2 
generalizes  to 

Theorem  4:  The  f's  in  Fare  mutually  detectable  if  and  only  if 

r 

p= 2  "<•  <24> 

i- 1 

Proof  is  given  in  Appendix  C.  Note  that  if  m  =  r,  then  CF  is 
invertible  and  C  =  0  in  (22).  Therefore,  v  =  n,  and  Theorem  4 
implies  Theorem  2. 

The  results  of  Theorem  3  must  be  slightly  modified  since  q 
eigenvalues  and  eigenvectors  remain  to  be  assigned  after  the 
detection  filter  has  been  designed  for  the/,  i  =  1,  2,  •  •  • ,  r.  q  is 
the  rank  of  M,  which  is  defined  by  (23).  These  q  eigenvalues  and 
eigenvectors  are  freely  assignable,  provided  that  the  eigenvectors 
are  independent  of  those  which  span  the  r  detection  spaces.  All  of 
the  solution  techniques  of  Theorem  3  are  also  applicable  for  the 
case  of  r  <  m,  since  the  developed  techniques  allow  for  the 
independent  solution  of  the  subset  of  equations  in  ( 12)  correspond¬ 
ing  to  a  single  /,  for  the  appropriate  v,  eigenvectors  and  n 
constraints  on  D  [6], 

V.  Detection  Filters  with  Nondistinct  Eigenvalues 

The  assumptions  of  Section  III-B  are  also  imposed  on  the 
analysis  of  (his  section,  except  that  nondistinct  eigenvalues  and  r 
<  m  are  now  allowed. 

Some  interesting  eigenstructure  constraints  are  imposed  by 
output  separability  when  nondistinct  eigenvalues  are  allowed. 
First,  it  is  demonstrated  that  the  detection  spaces  are  independent. 

Lemma  3:  If  the  failure  vectors  in  Fare  output  separable,  then 
the  detection  spaces  of  the  f’s  are  pairwise  independent.  The 
proof  is  given  in  Appendix  D. 

The  proof  shows  that  there  are  no  eigenvectors  which  span 
some  overlap  in  the  detection  spaces.  The  implication  of  this 
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lemma  on  ihe  Jordan  form  structure  of  G  is  that  any  Jordan  block 
must  be  completely  contained  in  the  v, -dimensional  part  of  the 
Jordan  structure  corresponding  to  the  /th  detection  space,  even  if 
the  algebraic  multiplicity  of  the  eigenvalue  is  greater  than  v,.  This 
requires  that  whenever  identical  eigenvalues  are  to  be  assigned  to 
A:  detection  spaces,  the  eigenvalue  must  be  assigned  a  geometric 
multiplicity  of  k  to  preserve  the  independence  of  the  detection 
spaces  This  can  be  illustrated  quite  simply  by  falsely  assuming 
that  ci  and  c;  lie  in  different  detection  spaces  when  v2  is  a 
generalized  eigenvector  of  c, .  By  the  definition  of  a  generalized 
eigenvector,  then  it  must  be  true  that  CGv2  -  \Cv2  -  Cu,  which 
must  be  equal  to  (a  is  a  constant  scalar  coefficient)  aC/2  =  Cfi  if 
i’:  and  i't  are  to  lie  in  their  respective  detection  spaces.  However, 
this  is  a  contradiction  bv  the  definition  of  output  separability. 

The  tact  that  the  geometric  multiplicity  must  be  exactly  k  can  be 
shown  by  assuming  falsely  that  two  primary  eigenvectors  within  a 
single  detection  space  have  the  same  eigenvalue.  If  these 
eigenvectors  are  denoted  by  c,  and  c;.  then  c,  -  v2  must  also  be 
an  eigenvector.  However,  this  implies  a  contradiction  of  the 
observability  assumption  since  C(t’,  -  v2)  =  0  by  Theorem  I. 
Hence,  ihere  can  only  be  one  primary  eigenvector  per  detection 
space  associated  with  a  particular  eigenvalue. 

It  the  system  ot  (12)  is  rewritten  to  include  generalized 
eigenvectors,  then  the  results  of  Sections  III-C  to  IH-E  can  be 
confirmed  to  be  valid  when  nondistinct  eigenvalues  are  allowed 
since  Theorem  I  can  be  proven  |6]  for  an/,  defined  in  terms  of 
primary  and  generalized  eigenvectors.  The  results  of  Sections  III- 
D  and  III  E  can  also  be  extended  to  the  nondistinct  eigenvalue 
case  by  inspection  if  the  summation  term  in  (13)  is  modified. 
Because  ot  Theorem  1.  this  modified  term  will  drop  out  of  an 
analysis  similar  to  that  used  to  obtain  M,  in  terms  of  K,  rather  than 
A,  (see  remark  after  Lemma  2).  and  the  results  of  those  sections 
carries  over  to  the  nondistinct  eigenvalue  problem. 

The  system  of  ( 12)  is  still  valid  for  those  values  of  j  associated 
with  primary  eigenvectors,  while  the  right  side  of  (12)  must  be 
replaced  bv  |-(e'  wr|r  for  those  values  of  j  corresponding 
to  generalized  eigenv  ectors.  The  results  of  Section  IV  are  trivially 
extended  to  the  nondistinct  eigenvalue  case. 

VI  Nonmutually  Detectable  Failures 

If  the  condition  of  (24)  is  not  satisfied,  then  a  solution  to  the 
detection  filter  problem  cannot  be  found  which  is  detectable  in  the 
sense  of  Definition  1  In  this  case,  (E'=|  v,)  +  q  <  n  and  there 
exists  an  excess  subspace  of  dimension  vr  such  that 


This  excess  subspace  exists  as  a  result  of  making  the  detection 
filter  respond  to  the  set  of  failure  directions  in  F ,  which  requires  v 
eigenvalues,  while  only  being  able  to  freely  assign  5/  p, 
eigenvalues  with  respect  to  the  individual  failure  directions. 
Hence,  v,  eigenvalues  will  be  fixed  by  the  choice  of  the  system 
and  the  set  F.  When  this  occurs  the  system  is  said  to  be 
nonmutually  detectable  ( 3 J  or  restrictive  (I0|. 

The  procedure  |6|  for  obtaining  a  mutually  detectable  system  by 
eliminating  some  of  the  fs  in  F  essentially  examines  the 
eigenvectors  which  span  the  excess  subspace  so  as  to  associate  the 
removal  of  an  /  with  the  elimination  of  some  of  those  excess 
eigenvectors  After  examining  each  of  the  /,  in  such  a  manner,  the 
dimension  of  the  excess  subspace  for  all  possible  combinations  of 
the  /,  can  be  simply  determined  Those  combinations  which 
produce  an  e  xcess  subspace  of  dimension  zero  are  those  subsets  of 
the  original  set  F  .vhich  permit  a  mutually  detectable  problem. 
Note  that  (24)  must  be  satisfied  when  r  =  I  because  M,  =  AT  and 
a  mutually  detectable  system  always  occurs.  The  removal  of 
failure  directions  from  the  original  set  of  r  directions  may  be 
averted  by  an  alternate  technique  which  was  first  suggested  by 
Jones  (4 1  This  technique  is  that  of  adding  dynamics  to  the  original 


open-loop  system  in  such  a  way  that  (24)  is  satisfied.  If  the 
original  open-loop  system  is  a  minimal  realization,  then  it  is  both 
observable  and  controllable  (with  respect  to  B).  Although  the  new 
open-loop  system  will  be  required  to  be  an  equivalent  realization 
of  the  original  system,  the  enlarged  system  will  be  observable  but 
not  controllable. 

The  motivation  for  enlarging-  the  state  space  to  obtain  a 
mutually  detectable  set  of  failures  requires  a  slightly  more  detailed 
understanding  of  the  structure  of  the  excess  subspace.  CV,,  where 
vr  is  any  one  of  the  eigenvectors  spanning  Ihe  excess  subspace, 
does  not  lie  in  any  one  detection  space.  However,  since  Cv,  *  0 
by  the  discussion  w  ith  respect  to  1 10)  and  is  directly  observable, 
then  CV ,  must  then  be  some  linear  combination  of  the  directions  of 
Cf  for  /  =  1,  2,  •  •  ■,  r.  Each  eigenvector  in  the  excess  subspace 
must  have  a  component  in  two  or  more  of  the  detection  spaces, 
otherwise  by  Definition  4  and  Theorem  I  the  eigenvector  would 
lie  in  and  span  some  detection  space.  The  removal  of  a  failure  J\ 
from  the  set  F  removes  those  eigenvectors  of  the  excess  subspace 
which  have  components  along  Cfk.  The  removal  of  some  other 
failure  direction  other  than  /,  may  also  remove  some  of  the  same 
eigenvectors  that  fk  would  remove.  The  essential  idea  of  state- 
space  enlargement  is  to  increase  the  dimension  of  the  state  space 
in  such  a  way  that  the  presence  or  absence  of  an  /.  in  /  does  not 
affect  the  dimension  of  the  excess  subspace  of  (he  enlarged  stale 
space.  This  will  be  accomplished  by  the  choice  of  a  new  open-loop 
system  matrix  such  that  the  Arth  detection  space  is  enlarged  by  an 
amount  equal  to  the  number  of  eigenvectors  ol  (he  original  excess 
subspace  which  have  components  along  C/, .  while  ensuring  that 
the  new  excess  subspace  eigenvectors  have  no  component  along 
Cft. 

Before  other  constraints  on  the  enlargement  of  the  state  space 
can  be  considered,  the  requirements  for  an  observable,  equivalent 
realization  {A,  B,  C)  with  a  dimension  of  h  >  n  must  be 
established  and  satisfied.  The  open-loop  models  of  (.4.  B,  C)  and 
{A,  B,  C)  are  defined  to  be  equivalent  realizations  (i.e.,  input- 
output  equivalent)  if  CA'B  =  CA'B  for  j  <  h  when  n  >  n.  One 
form  of  the  system  ( A ,  B,  C)  which  can  be  observable  and  is 
input-output  equivalent  for  any  h  >  n  is  given  by 


Let  vek  and  veK  represent  the  number  of  eigenvectors  of  the 
original  excess  space  which  have  output  components  that  lie  and 
do  not  lie,  respectively,  along  the  direction  C/t .  The  sum  of  these 
quantities  is  v,.  The  state  space  enlargement  approach  is  made 
possible  by  the  following  theorem  which  is  proved  in  Appendix  E. 

Theorem  5:  There  exists  an  observable  extension  of  (A,  C) 
into  (A,  C)  of  the  form  of  (26)  for  h  k  n  +  vrt  such  that  a)  Pek  = 
0,  b)  r*  =  r*  +  v,*,  c)  Pj  >  v ,  for  all  j  *  k,  and  d)  Pe  <  ve. 
Sufficient  conditions  for  a)  through  d)  to  occur  are  that  A,2  and 
A  22  be  chosen  as 

A 12  ^  \- -fk),  -«4V(X,V‘ “/»)!•  -4::  =  A,* 

(27) 

where  N  k  vrk,fk  k  summation  term  of  (13)  or  the  analogous 
nondistinct  eigenvalue  summation  term,  and  V,,  Ark  are  the 
eigenvalues  in  clement  and  matrix  (either  diagonal  or  Jordan 
form)  representations  that  correspond  to  the  eigenvector  vJ  and 
eigcnmatrix  Vfk,  respectively. 

Remark:  The  scalar  a1' s  of  (27)  may  be  calculated  as  o'  =  Ath 
row  of  (CF)'Cv'rk. 

A  sequential  application  of  the  Theorem  5  can  be  used  to 
generate  a  mutually  detectable  system.  Repeated  application  of 
those  results  with  respect  to  successive  failure  directions  in  F  will 
cause  each  of  those  directions  in  turn  to  be  eliminated  from  those 
eigenvectors  which  remain  to  span  the  excess  subspace.  Eventu¬ 
ally,  the  excess  subspace  will  be  eliminated  entirely  and  a 
mutually  detectable  system  will  have  been  formed. 
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Theorem  6:  The  nonmutually  detectable  system  of  (A,  C,  F) 
with  output  separable  /t’s  can  be  enlarged  into  an  observable, 
equivalent  realization  (Ar,  Cr,  F  =  [FT,  0]  0  which  is  mutually 
detectable  with  respect  to  the  same  set  of  failure  directions  (the 
/t’s)  with  the  minimal  state-space  dimension  of 

r 

hr  =  n+'£  vtk-vt  (28) 

k  «  I 

after  r  -  1  applications  of  Theorem  5. 

The  proof  is  given  in  Appendix  F, 

VII.  Output  Stationarity 

If  the  freedom  to  arbitrarily  choose  the  closed-loop  eigenvalues 
is  somewhat  restricted,  [4]  has  shown  that  more  than  m  failure 
directions  can  be  designed  into  the  detection  filter.  By  definition, 
these  additional  directions  are  not  output  separable  with  the 
original  set  of  directions  in  F.  If  hk  is  a  direction  to  be  added  to  the 
set  of  m  directions  in  F,  then 

m 

**  =  £«,/  +  **  (29) 

t*  I 

where  some  of  the  5,  may  be  zero  and  C£*  =  0.  If  hk  satisfies 
certain  conditions,  then  the  output  direction  associated  with  hk  can 
be  made  unidirectional  by  requiring  that  particular  subsets  of  the 
closed-loop  eigenvalues  take  on  identical  values.  In  the  terminol¬ 
ogy  of  [4]  this  is  referred  to  as  making  hk  output  stationary  with  F 
The  simplest  case  of  this  occurs  when  m  =  n  and  all  of  the 
eigenvalues  are  chosen  to  be  identical.  Under  these  conditions,  v, 
=  1  for  i  =  1,  2,  •  •  •,  n.  Since  •>,=  !,  then  u,  =  /,  and  the 
eigen  equations  become  Gf  =  X/,.  Here  hk  is  just  a  linear 
combination  of  the/’s  since  £*  4  0  is  required  for  C£k  =  0  to  be 
true.  Now  any  hk  can  be  detected  by  the  detection  filter  for  F since 
every  hk  will  also  be  an  eigenvector  for  G. 

A.  Output  Stationarity  for  a  Single  Additional  Failure 
Direction 


is  defined  to  be  of  some  dimension  vk ,  where  vk  <  min  (p,)  for  all 
i’s  corresponding  to  nonzero  6,’s  in  (29)  |6).  The  requirements  for 
output  stationarity  are  summarized  in  Theorem  7.  The  proof  is 
given  in  Appendix  G. 

Theorem  7:  For  a  mutually  detectable  problem  defined  by  A , 
C,  F,  it  is  possible  to  make  the  nonoutput-separable  direction  hk 
output  stationary  with  the  directions  in  F  if  and  only  if  two 
conditions  are  satisfied.  First,  £*  of  (29)  must  lie  in  the  union  of 
the  detection  spaces  of  the  /,’ s  which  correspond  to  nonzero  6, 
(assume  for  notational  simplicity  that  these  correspond  to  ;  =  1 , 
2,  •  •  ■ ,  /)•  The  second  condition  is  that  each  group  of  p, 
eigenvalues  must  contain  the  same  set  of  p*  arbitrarily  assignable 
eigenvalues  for  /  =  1,2,  ••-,/.  If  p*  <  p,  for  i  =  1,2 ,•••,/, 
then  p,  -  p*  unassignable  eigenvalues  will  exist  with  respect  to  the 
tth  detection  space. 

Remark:  Implementation  of  the  results  of  Theorem  7,  in  the 
spirit  of  the  previous  algorithms,  is  quite  straightforward.  The 
first  case  to  be  considered  is  when  pk  =  p,  for  all  i  =  1,2 ,•••,/. 
In  this  case  the  determination  of  the  detection  gain  matrix  and  the 
closed-loop  eigenvectors  proceeds  exactly  the  same  way  as  with 
(12),  except  that  the  eigenvalues  with  respect  to  the  detection 
spaces  of  the/’s  for  i  =  1,2 ,••■,/  must  be  chosen  as  identical 
sets  of  p*  eigenvalues.  The  eigenvalues  with  respect  to  the  other 
detection  spaces  can  be  freely  assigned. 

The  second  case  to  be  evaluated  is  when  i>k  <  p,  for  some  or  all 
of  the  f,  detection  spaces  for  /  =  1,  2,  •••,/.  In  this  case  the 
equations  of  (12)  must  be  solved  with  the  same  eigenvalue 
constraints  as  in  the  first  case.  However,  p,  -  vk  eigenvalues  are 
unassignable  for  each  of  the  detection  spaces  where  p,  >  vk. 
These  unassignable  eigenvalues  complicate  the  solution  of  (12) 
since  there  will  be  more  unknowns  than  equations.  This  complica¬ 
tion  is  eliminated  by  requiring  that  the  system  equations  of  (12)  be 
solved  simultaneously  with 

/  *, 

**  =  £  £  S'iur  S‘j  *  b'a',  +  l'r  (3°) 

I  «  I  J  -  1 

<3» 

J- 1 
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The  fully  measurable  case  is  a  powerful  motivation  for 
examining  the  general  conditions  under  which  additional  failure 
directions  can  be  detected.  This  section  will  determine  those 
conditions  which  must  be  satisfied  to  make  a  single  additional 
failure  direction  hk  output  stationary  with  the  failure  directions  of 
the  set  F. 

The  following  assumptions  are  made  for  the  analysis  to  follow: 
1)  ht  *  0,  2)  r  =  m,  3)  A,  C,  F  imply  mutual  detectability,  and 
4)  distinct  eigenvalues.  The  first  assumption  follows  directly  from 
the  previous  assumption  that  Cf,  *  0,  (29),  and  output  separabil¬ 
ity.  The  second  assumption  is  made  so  that  the  maximum  number 
of  output  separable  failures  will  be  designed  into  the  detection 
filter.  This  in  turn  assists  in  maximizing  the  number  of  hk’s  which 
can  be  made  output  stationary  with  the  directions  in  F.  Extension 
of  the  results  to  the  r  <  m  case  will  follow  trivially  from  the  r  = 
m  analysis.  The  third  assumption  requires  that  the  system  be 
mutually  detectable,  either  naturally  or  by  the  methods  of  Section 
VI.  This  assumption  implies  that  no  unassignable  eigenvalues 
exist  as  a  result  of  fixing  the  output  directions  in  CF.  This  allows 
for  the  maximum  possible  flexibility  in  determining  the  conditions 
for  output  stationarity.  Similarly,  distinct  eigenvalues  are  assumed 
because  this  aids  in  maximizing  the  freedom  allowed  in  specifying 
output  stationarity  criteria.  Complete  eigenvalue  assignability  is 
the  freedom  sacrificed  in  fixing  more  than  m  failure  directions. 

The  output  stationarity  problem  is  concerned  with  fixing  the  h„ 
detection  subspace  in  the  state  space  while  simultaneously 
maintaining  the  detection  subspaces  with  respect  to  the  directions 
in  F.  The  detection  space  for  some  direction  hk,  which  is  to  be 
made  output  stationary  with  the  output  separable  directions  in  F, 


along  with  the  condition  that  =  0  for  all  j  >  vk  and  /  =  1,2, 
•••,/.  The  6,  ’s  of  (30)  and  (3 1 )  are  known  a  priori  from  (29)  and 
y)  is  found  from  the  representation  £*  =  ,  y  V.  Equation 

(31)  requires  that  hk  satisfy  an  equation  of  tne  formVf  (13)  with 
the  same  detection  gain  matrix  as  that  used  to  fix  the  output 
directions  in  F.  Also,  all  of  these  equations  are  expressed  in  terms 
of  the  eigenvalues  and  eigenvectors  of  the  f  detection  spaces  and 
are  compatible  with  the  unknowns  of  (12). 

VIII.  Examples  of  Detection  Filter  Design 

Example  (a):  This  is  an  example  of  the  eigensystem  assignment 
methods  for  detection  filter  design  when  the  closed-loop  eigen¬ 
values  are  chosen  to  be  distinct,  r  =  m,  and  the  A ,  C,  F system  is 
mutually  detectable.  Let 

r°34i  To  i  oi  m  r  1 1 

1  2  3  ’  C=  0  0  1  ’/l=  ••/*--  1/2  . 
o  2  5j  Lu  u  J  |_  0 J  I  1/2 

Since  the  rank  of  CFis  r  =  2,  then  the  failure  directions  of/,  and 
/  are  output  separable.  The  test  for  mutual  detectability  using 
Lemma  1  and  the  following  remark  produces  M,  for  i  =  1,  2  as 


0  0  1  1  0 
005  ,  M2=  1/2 

0  0  25  7/2 


1/2  1/2 
7/2  5/2 

91/4  63/4 
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where  the  zero  or  redundant  rows  have  been  omitted.  The  rank  of 
M\  is  1  which  implies  that  p,  =  n-  l=  3-  I=2.  The  rank  of 
A/:  is  2  and  this  means  that  v2  -  n  -  2  =  1 .  Since  n  =  p,  +  v2, 
then  the  system  A,  C,  F  is  mutually  detectable.  The  closed-loop 
eigenvalues  are  now  assigned  to  the  respective  detection  spaces. 
Since  Pi  =  2,  the  choices  of  Xj  =  -2  and  XI  =  -  3  are  assigned 
to  the  detection  space  of /,.  Likewise,  the  choice  of  X;  =  -4  is 
made  for  the  detection  space  of  f2.  The  closed-loop  eigenvectors 
and  the  detection  gain,  D.  are  now  determined  from  ( 12).  The  two 
sets  of  linear  equations  obtained  for  X |  and  X;  can  be  solved  to 
obtain  du  =  9,  d2l  =  7,  d}i  =  2,  while  the  two  eigenvectors 
which  span  the  detection  space  of  f  are  calculated  to  be  d|  = 
(3  1  0)r,  cl  =  (2  1  0]r.  The  single  equation  of  (12)  for  Xj 
produces  the  detection  gain  elements  as  d,2  =  18,  d22  =  6,  and 
d32  =  9  where  the  eigenvector  is  known  to  be  f2  since  p2  =  1. 

Example  (b):  This  example  demonstrates  the  techniques  of 
Section  V  for  r  =  m.  nondistinct  eigenvalues,  and  a  mutually 
detectable  system.  This  problem  is  a  repetition  of  Example  (a) 
except  that  here  all  of  the  eigenvalues  are  chosen  to  be  identical. 
Because  of  the  detection  space  structure  of  this  example,  X  will 
have  a  geometric  multiplicity  of  2  but  an  algebraic  multiplicity  of 
3,  where  X  =  -  2.  The  system  of  (12)  is  used  for  X J,  while  for  XJ 
(12)  must  be  modified  to  accommodate  generalized  eigenvectors 
as  discussed  in  Section  V.  These  two  sets  of  linear  equations  can 
be  solved  to  produce  dt>  =  7,  d2 ,  =  6,  du  =  2,  and  t/j  = 
[3  1  0| r.  For  Xj  (12)  can  be  used  where  again  the  eigenvector  is 
constrained  to  be  f2  since  p2  =  1.  The  detection  gain  elements  are 
d\2  =  12,  d22  =  7,  and  d32  =  7. 

Example  (c):  This  example  deals  with  the  implementation  of 
the  detection  algorithms  of  Section  VI  for  the  restrictive  problem 


terms  of  those  eigenvectors.  As  mentioned  in  Section  VII-A,  the 
calculation  of  the  eigenvectors  and  detection  gain  is  complicated 
by  the  additional  equations  to  be  satisfied  when  making  some 
vector  output  stationary  with  the  directions  in  F  The  system  of 
(12)  can  be  employed  to  write  equations  with  respect  to  X  j  =  -  2, 
XJ  =  unassignable,  and  XJ  =  -2.  The  additional  equations  to  be 
utilized  for  the  output  stationarity  case  are  (30)  and  (31).  The 
additional  constraint  that  61  =  0  must  also  be  enforced  since  p,  > 
Pi.  All  of  these  equations  can  be  solved  to  obtain 

'-1  4]  [-2  2  1 

D=  2  3  ,  P=  11-1/2  (32) 

2  7  0  0  1/2 

with  6;  =  1,6;  =  0,  6J  =  2,  and  XJ  =  +2. 

The  reader  should  notice  that  the  unassignable  eigenvalue  here 
takes  on  the  same  value  as  the  excess  eigenvalue  of  Example  (c) 
where  h ,  and  f2  of  the  above  example  were  used  as  the  design 
failure  directions.  This  is  an  interesting  check  of  the  above  output 
stationarity  procedure,  since  any  effort  to  design  the  directions  of 
hi  and  f2  into  a  detection  filter  must  result  in  an  unassignable 
eigenvalue  with  a  value  of  +2.  The  comparison  between  this 
example  and  that  of  Example  (c)  may  be  further  enhanced  if 
Example  (c)  is  recalculated  with  XJ  =  -  2.  In  this  case,  the  gain 
matrix  and  excess  eigenvalue  are  identical  to  the  gain  matrix  and 
unassignable  eigenvalue  of  Example  (d).  Furthermore,  u,  of  the 
recalculated  version  of  Example  (c)  will  be  identical  to  of 
Example  (d),  which  is  associated  with  the  unassignable  eigenvalue 
XJ.  Therefore,  the  state  enlargement  technique  could  be  used  as  in 
Example  (d)  to  produce  a  mutually  detectable  system. 


where  distinct  eigenvalues  and  r  -  m  are  assumed.  The  system  of 
Example  (a)  is  used  again  here,  except  that  /  =  [0  0  1] T .  From 
Lemma  1  and  the  following  remark  p,  =  1.  This  problem  is, 
therefore,  restrictive  since  p  >  p,  +  v2  =  2.  The  dimension  of 
the  excess  subspace  is  n  =  3  -  2  =  1.  Since  only  two 
eigenvalues  are  arbitrarily  assignable,  X}  =  -2  and  XJ  =  -3 
with  the  excess  eigenvalue  left  to  be  determined.  The  two  systems 
of  equations  from  (12)  are  solved  for  the  detection  gain  matrix 
where  the  first  column  of  D  is  [  -  3  3  1J T  and  the  second  column 
of  D  is  (4  3  7]  r.  From  G  of  (4)  calculate  v ,  and  X,  from  (10)  as  X, 
=  2  and  u,  =  [12  4  l[r. 

The  results  of  Theorem  5  may  be  employed  to  enlarge  the  state 
space  where/i  =  Xjttj  =  -2(00  l]r.  The  enlargement  technique 
is  applied  with  respect  to  f  in  this  example,  although  f2  could  just 
have  easily  been  chosen.  Now  At2  of  Theorem  5  can  be  written  as 
-4,2  =  -5[2/,  +  2/,]  =  [0  0  - 20] T  while  A 12  =  X,  =  +2. 
This  choice  of  the  new  open-loop  system  must  enlarge  the 
dimension  of  the  detection  space  of  f  by  b)  of  Theorem  5,  and 
must  also  enlarge  the  detection  space  of  f2  by  Theorem  6.  This 
problem  is  an  example  of  the  case  where  p,  <  p,.  The  dimension 
of  both  detection  spaces  will  be  enlarged  by  one  and,  hence,  the 
new  system  will  be  mutually  detectable. 

Example  (d):  This  example  will  demonstrate  the  techniques  of 
Section  VII.  The  objective  here  will  be  to  make  a  single  additional 
nonseparable  direction  h,  =  [0  0  l]r  output  stationary  with  the 
failure  directions  of  Example  (a).  By  definition  the  direction  h , 
can  be  written  in  the  form  of  (29)  as  h,  =  f,  +  2 f2  +  £,  where  6( 
y  1 . 62  =2,and£i  ^[100]r.  Because  the  h3  direction  coincides 
with  the  /,  direction  of  Example  (c),  the  calculation  of  p,  here  is 
unnecessary  since  the  calculation  of  P|  in  Example  (c)  implies  that 
pi  =  1 .  From  Example  (a)  it  should  be  recalled  that  p,  =  2  and  v2 
=  1.  Hence,  the  application  of  Theorem  7  will  require  that  only 
one  freely  assignable  eigenvalue  be  assigned  to  the  detection 
spaces  of  /,  and  f2,  and  it  must  be  identical  for  both  detection 
spaces.  Because  P|  >  p,  there  will  be  one  unassignable  eigenvalue 
with  respect  to  the  detection  space  of  /,.  The  other  condition  of 
Theorem  7  which  must  be  satisfied  is  that  £i  lie  in  the  union  of  the 
detection  spaces  of /,  and  f2.  The  requirement  is  clearly  satisfied 
since  the  eigenvectors  which  span  the  detection  spaces  of /,  and  f2 
will  also  span  the  state  space  and,  therefore,  can  be  written  in 


VIII.  Conclusions 

A  derivation  of  the  detection  filter  theory  from  an  eigensystem 
assignment  approach  has  been  presented.  The  motivations  for  and 
the  development  of  this  theory  are  easily  accomplished  by  this 
approach.  The  analysis  results  in  a  set  of  simultaneous  equations 
to  be  solved  for  the  detection  filter  gains  and  the  closed-loop 
eigenvectors.  Necessary  and  sufficient  conditions  for  the  solution 
of  this  system  of  equations  have  been  given.  An  example  is 
presented  which  illustrates  and  integrates  all  of  the  theory  and 
associated  algorithms. 

Appendix  A 
Proof  of  Theorem  1 

Theorem  1  has  a  two-pan  proof  which  is  given  below  in  the 
form  of  two  lemmas  and  their  respective  proofs. 

Lemma:  Condition  a)  of  Definition  1  is  satisfied  if  and  only  if 
rank  (OP,)  =  1. 

Proof:  This  proof  is  due  to  Beard  [3],  The  controllable  space 
of  f,  with  respect  to  G  is  spanned  by  the  columns  of  the 
controllability  matrix  IP,,  where  IP,  is  defined  as 

IP  =  [/,.  Gf„  •••,  Gn~'f).  (A.  1) 

Sufficiency  of  the  lemma  can  be  established  by  noting  that  e(t) 
must  lie  in  the  range  space  of  IP,.  Therefore.  e(/)  can  be  written  as 
a  linear  combination  of  the  columns  of  IP,  as  «(r)  =  !P,g(r), 
where  g{t)  is  an  n  x  1  vector  of  coefficients.  The  output  error  can 
now  be  written  as  {'(/)  =  Ct(t)  =  CfP,g(r).  It  is  sufficient  that 
CIP,  be  of  rank  I  to  ensure  that  e(t)  is  unidirectional  for  any  g(t). 
The  necessity  of  the  lemma  can  be  shown  by  observing  that  e{t) 
can  be  driven  by  ft, ft)  to  any  state  in  the  controllable  space  off 
with  respect  to  G.  Hence,  condition  a)  of  Definition  1  is 
guaranteed  for  arbitrary  p,(t)  only  if  rank  CIP,  =  1.  □ 

Lemma:  Rank  (CIP,)  =  1  if  and  only  if  Cf  and  Cv'  are 
collinear  for  ally  =  1,  2,  •  •  -,  nt. 

Proof:  The  /  superscripts  on  the  vfs  and  the  corresponding 
XJ’s  and  aj's  have  been  suppressed  in  this  proof  for  the  sake  of 
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notational  simplicity.  Sufficiency  can  be  demonstrated  by  replac¬ 
ing  the  /,  in  CH',  with  (11),  and  then  (10)  can  be  employed  to 
obtain 

CW',=  £  ctj Ci'j ,  Y,  «j\,Cu„  •,  Yt  ayX""'C'uy  .  (A. 2) 

L  J  J  J  J 

Now  if  Cvj  and  Cf,  are  collinear  for  all  j  such  that  ay  *  0,  then 
Of',  becomes 

Clf',  =  (a|C/,a;C/,  •••,<*„_!<?/,]  (A. 3) 

where  the  a*  represent  the  coefficients  of  Cf,  after  the  summa¬ 
tions  have  been  executed.  Clearly,  the  rank  of  CW,  is  one  and 
sufficiency  has  been  proved. 

Necessity  can  be  proven  if  it  can  be  shown  that  i(t)  is 
unidirectional  only  when  the  conditions  of  the  lemma  are  satisfied. 
The  solution  to  (6)  can  be  written  as 

f (/)  =  eGlf (0)  +  f  eC{,'r)f^,(T)  dr.  (A.4) 

Jo 

The  transient  in  (A.4)  is  considered  to  be  zero,  by  assuming  that 
either  f  (0)  =  0  or  that  the  transient  due  to  the  initial  condition  has 
settled  out.  (This  requires  that  G  be  chosen  to  be  stable.)  Then, 
the  substitution  of  (11)  into  (A.4)  and  the  assumption  that  n,  is 
constant  (any  conditions  obtained  under  this  assumption  must  also 
apply  for  an  arbitrary  n,(t)  since  fi,  =  constant  is  still  a  possible 
failure  mode)  gives 

ni 

li‘a‘  |  ecu~n  drvj.  (A. 5) 

j- 1  0 

If  (A.5)  is  premultiplied  by  C.  exp  [G(r  -  r))u,  =  exp  [A,(r  - 
r)]uy  is  substituted  into  (A.5),  and  the  integration  of  (A.5)  is 
performed,  then  the  output  error  becomes 

£  (^)[l-<?V]Cvy.  (A. 6) 

By  (A. 6)  the  conditions  of  the  lemma  are  necessary  for  e(()  to 
have  a  fixed  direction  and,  therefore,  for  CW,  to  be  rank  1.  □ 
Theorem  1  combines  the  two  lemmas  of  this  Appendix. 

Appendix  B 
Proof  of  Lemma  2 

The  definitions  of  q,  and  p,  in  Lemma  2  imply  that  the 
observability  matrix  with  respect  to  (C„  A,)  is  of  rank  q,  while  the 
null  space  or  unobservable  space  is  of  dimension  v,.  Hence,  q, 
closed-loop  eigenvectors  span  A/,,  while  the  remaining  v,  closed- 
loop  eigenvectors  span  the  null  space  of  M,.  These  v,  eigenvectors 
must  by  definition  satisfy  the  condition  of  C,v‘  =  0  for  all  j  =  I, 
2,  •  •  •,  vt.  This  condition  implies  that  these  eigenvectors  must  be 
eigenvectors  of  A,.  This  can  be  observed  by  postmultiplying  (16) 


by  if  for  all  j  =  1 ,  2,  •  •  • ,  v,  to  obtain 

A, t/  =  (<4  -DC)u'j  +  D,C,v'j. 

(B.l) 

This  equation  reduces  to 

A,v't  =  +  D,C,v't 

(B.2) 

by  (10).  The  last  term  in  (B.2)  is  zero  for  all  of  the  eigenvectors  in 
the  null  space  of  M,  and.  therefore,  they  must  all  be  eigenvectors 
of  A,.  Hence,  v,  eigenvalues  and  eigenvectors  of  A,  are  specified 
by  the  solution  of  (13)  (i  e . ,  with  respect  to  f).  These  eigenvalues 
and  eigenvectors  cannot  be  affected  by  the  choice  of  D,  since  the 
last  term  in  (B.2)  is  always  zero.  However,  the  remaining  q , 


eigenvalues  and  eigenvectors  are  not  constrained  by  the  solution 
of  ( 13)  and  are  freely  assignable  by  the  choice  of  I),  The  proot  of 
Lemma  2  is  now  complete.  □ 

Appendix  C 
Proof  of  Theorem  4 

This  proof  is  due  to  Beard  [3],  Lemma  2  implies  that  there  are  q 
eigenvalues  that  are  freely  assignable  by  D  in  (15)  after  D  has 
been  constrained  to  be  a  detection  filter  for  all  the  f  in  F.  Output 
separability  implies  that  the  eigenvalues  associated  with  the 
detection  space  of  each  of  the  f  can  be  specified  independently  of 
the  other  detection  spaces  (i.e.,  Lemma  3).  Lemma  2  implies 
from  (15)— ( 18)  that  v,  eigenvalues  are  associated  with  each 
detection  space.  Condition  b)  of  the  definition  of  detectability 
requires  that  n  eigenvalues  be  arbitrarily  assignable  when  D  is 
constrained  to  be  a  detection  gain  for  all  of  the  /  in  F.  Hence. 

£  v^j+q  =  n.  (C  l) 

Since  v  k  n  -  q,  then  Theorem  4  is  proven.  □ 

Appendix  D 
Proof  of  Lemma  3 


Appendix  E 
Proof  of  Theorem  5 

The  organization  of  this  proof  is  as  follows.  First,  b),  c),  and  d) 
will  be  shown  to  result  from  (27)  in  a  very  direct  and  simple  way. 
Then  it  will  be  shown  that  the  definitions  of  (27)  imply  a),  because 
the  new  excess  space  will  not  have  any  output  component  along 
the  direction  CJk  =  Cfk.  where /[  =  [/[,  0) r.  This  form  of/*  is 
required  by  the  way  in  which  a  failure  physically  enters  the 
problem. 

The  determination  of  the  detection  space  of  /*  with  respect  to 
the  system  (A,  (?)  is  analogous  to  the  procedure  of  Section  III-D. 
The  quantities 


<?*  =  [C*  0) 


■  *  L°  A»\ 


(E.l) 


are  easily  derived  for  the  enlarged  state  space  from  equations  of 


The  lemma  can  be  proven  if  it  can  be  shown  that  the 
eigenvectors  that  span  the  /th  detection  space  are  independent  of 
those  that  span  the  other  detection  spaces  for  i  =  1 , 2,  •  •  • ,  r.  The 
independence  of  these  sets  of  eigenvectors  implies  that  there  are 
no  eigenvectors  which  span  some  overlap  in  the  detection  spaces 
and,  hence,  that  the  detection  spaces  are  pairwise  independent. 

A  proof  by  contradiction  can  be  accomplished  by  assuming  that 
some  dependency  exists  among  the  sets  of  eigenvectors  that  span 
the  detection  spaces.  For  the  sake  of  notational  simplicity  it  is 
assumed  that  there  are  just  two  detection  spaces  and  that  the 
overlap  between  the  two  is  a  two-dimensional  subspace.  Further¬ 
more,  assume  that  u|  and  uj  are  the  two  eigenvectors  from  the  1'1- 
dimensional,  first  detection  space  which  span  the  overlap. 
Likewise,  assume  that  v*  and  u*  are  the  two  eigenvectors  from  the 
i^-dimensional,  second  detection  space  which  also  span  the 
overlap.  Since  both  sets  of  overlap  eigenvectors  span  the  same 
subspace,  then  it  must  be  possible  to  write  u,  and  ul  in  terms  of  u* 
and  iij.  For  instance,  u}  =  ui|V|  +  where  the  u*'s  are 
constant  scalar  coefficients.  Premultiplication  of  this  relationship 
by  C  (recall  that  CV  *■  0)  and  the  conditions  of  Theorem  1 
produce  Cf  =  (o>i  +  u>2 )Cf2,  which  is  a  contradiction  by  the 
assumption  of  output  separability.  These  arguments  are  easily 
extended  to  the  general  case  and  by  contradiction  Lemma  3  is 
true.  □ 
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the  form  of  ( 18)  and  ( 17)  with  (,4,  C)  replacing  (.4,  C)  and  where 
/t  replaces  /*.  Now  a  matrix  similar  to  (19)  can  be  written  and 
expanded  in  terms  of  (E.  1)  to  obtain 


C[.  ( Ck  Kk )  T.  ( CkK\)T , 

0,  (C*-*l|2)r,  (  C*  (  A-*  12  +/l|;/422))  t. 


(E.2) 


Because  the  definition  of  At2  in  (27)  makes  Al2  lie  in  the  null 
space  of  Afk.  the  second  column  of  terms  in  (E.2)  is  zero.  Hence, 
the  rank  of  Atk  is  equal  to  that  of  A/*.  This  implies  that  vk  =  n  - 
<7*  =  n  -  qk  =  n  +  uek  -  qk  =  vk  +  vtk  and  that  b)  is  true. 

A  matrix  similar  to  (23)  for  the  enlarged  system  can  also  be 
defined  for  At.  This  matrix  will  have  a  form  analogous  to  that  of 
(E.2)  and  similar  arguments  imply  that  v  =  v  +  vek.  This  fact  in 
conjunction  with  b)  and  the  form  of  A/,  for  j  t*  k  imply  that  c) 
and.  therefore,  d)  must  be  valid. 

The  final  step  is  to  show  that  a)  is  valid  because  the  excess 
subspace  has  no  components  that  lie  in  the  k th  detection  space. 
The  proof  requires  a  detailed  look  at  the  structure  of  the  excess 
subspace. 

Any  eigenvector,  vJf,  which  lies  in  the  original  excess  subspace 
can  be  represented  as  a  linear  combination  of  some  or  ali  of  the  r 
failure  directions  plus  some  vector,  sn  which  lies  in  the  null  space 
of  C.  Hence,  vJe  =  E'=|  o\f  +  s2  where  some  of  the  o\  may  be 
zero,  and  s,  cannot  be  zero  for  vJt  to  be  independent  of  the 
detection  spaces  eigenvectors.  Furthermore,  each  excess  eigen¬ 
vector  must  have  an  s,  vector  which  is  independent  of  the  other  sy 
vectors.  Therefore,  Vtk  is  represented  as 


"1 

where  the  s  for  j  =  1 ,  2,  •  •  • ,  vek  represent  different  subsets  of 
the  values  of  t  =  l,  2,  • • •,  r  where  i  =  k  is  by  definition  a 
member  of  each  subset.  Also,  each  s,  must  satisfy  CSj  =  0  for/  = 
1,2,  vek.  Equation  (10)  for  Vek  can  be  written  in  matrix  form 
as  A  Vrk  =  V'kAek  +  DCVek  which  can  be  rewritten  from  (E.3) 
and  (13)  as 


2  •••.2a;V,  +  S/v]  (E.3) 

/'  iN 

‘k  'k  J 


AKk=  Kk  A,k  +  [  £  a)(Af-f,),  af'M/,-/)]  . 

U  'S'  J 

(E.4) 

Since  the  $,  for  j  =  1,  2,  •••,  v(  must  be  independent,  then 
another  basis  for  the  excess  subspace  is  defined  by  S,  k  [j|,  •  •  •, 
■s*].  The  subset  of  vectors  Sek  in  Sr  which  originate  with  Vfk  must 
span  the  same  excess  subspace  with  respect  to  the  new  basis 
representation,  since  the  same  group  of  failure  directions  create 
that  part  of  the  excess  space.  The  existence  of  a  basis  representa¬ 
tion  for  the  excess  space  which  lies  in  the  null  space  of  C  is 
guaranteed  by  the  fact  that  the  /,  are  output  separable  and  only  r 
nonzero  output  directions  can  be  fixed  by  the  detection  spaces. 
Terms  from  the  left  and  right  sides  of  (E.4)  can  be  canceled  to 
form 


AS,k  =  StkArk+  r  £  a]  (Kf-lh  •.  2 

U  '?  J 

(E.5) 

which  is  now  in  terms  of  a  new  basis  for  the  excess  subspace  due 
fo/*. 

v,  =  v,  is  now  assumed.  This  assumption  means  that 
enlargement  of  the  state  space  does  not  change  the  dimension  of 
the  excess  subspace.  This  in  turn  implies  that  some  v,k- 
dimensional  basis,  S,v,  must  exist  in  the  ri -dimensional  state  space 
to  span  the  excess  subspace  that  had  previously  been  spanned  by 
Sfk  in  the  n -dimensional  state  space.  Such  a  basis  can  be  written  as 


Sck  =  [Srt,  l\T  where  /  is  a  vek  x  vek  identity  matrix  which  is 
chosen  arbitrarily.  If  it  can  be  shown  that  Sfk  is  independent  of  the 
presence  of  /*  by  the  choice  of  A |2  and  /122  of  (27),  then  a)  will 
have  been  proven. 

Equation  (E.5)  can  be  rewritten  so  as  to  be  consistent  with  the 
enlarged  state  space  dimension  as 


.  (E.6) 


The  assumed  form  (Theorem  5)  for  A  can  be  expanded  to  obtain 


L  0  J 

and  then  added  to  (E.6)  to  produce 
A  s,k  =  Sek  A,t  + 


(E.7) 


(E.8) 


where  the  index  k  has  now  been  eliminated  from  the  />  to  give  I1. 
Because  CStk _=  0,  the  detection  space  directions  of  interest  are 
given  by  ASek.  Since  the  right-hand  side  of  (E.8)  has  no 
component  in  the  £th  detection  space,  the  new  excess  space  will 
be  independent  of  the  presence  of  /* . 

The  case  where  vf  <  ve  can  be  handled  in  exactly  the  same 
manner  as  the  above  case.  The  only  difference  here  is  that  now 
one  or  more  of  the  vectors  in  n  X  vek  -dimensional  Stk  will  now  lie 
in  and  partially  span  an  enlarged  detection  space  with  respect  to 
one  or  more/,  for  j  *  k,  rather  than  all  of  the  vectors  spanning 
the  new  excess  space  as  in  the  previous  case. 

The  hypothesis  of  a)  in  Theorem  5  has  now  been  proven  and  the 
proof  of  Theorem  5  is  complete,  except  for  a  discussion  regarding 
the  observability  of  (A,  C).  The  enlarged  system  must  be 
observable  as  a  result  of  the  enlargement  construction  of  Theorem 
5.  This  can  be  verjfied  by  recalling  from  the  proof  of  part  b)  that 
the  null  space  of  A?  must  have  a  dimension  of  v  =  v  +  vtk,  and  is 
known  to  be  spanned  by  v  eigenvectors  which  either  span  one  of 
the  r  detection  subspaces  or  the  excess  subspace  that  exists  after 
the  enlargement  process.  If  any  one  of  these  eigenvectors  is 
multiplied  by  C,  then  the  constraint  of  Theorem  1  or  the  nature 
of  the  excess  space  [e  g.,  see  (E.3)]  requires  that  the  resulting 
vector  be  equal  to  one  of  the  C/  directions  for  i  =  1 ,  2,  ■  •  • ,  r,  or 
some  linear  combination  thereof.  Hence,  the  subspaces  which 
compose  the  null  space  of  A?  are  observable  by  the  construction  of 
Theorem  5,  since  Cf,  =  Cf  *  0  for  all  /  =  1,  2,  •  •  • ,  r.  If  r  < 
m ,  then  the  remaining  q  eigenvectors  can  be  freely  chosen, 
provided  that  they  are  independent  of  those  with  respect  to  the  r 
detection  subspaces  and  the  excess  subspace,  and  are  selected  to 
span  an  observable  subspace.  Hence,  the  enlarged  system  of  (A, 
C)  is  observable,  and  the  proof  of  Theorem  5  is  complete.  □ 


Appendix  F 
Proof  of  Theorem  6 

From  Theorem  5  i>k  =  vk  +  vrk  and  any  increase  in  the 
dimension  of  some  other  detection  space  must  come  as  a  result  of 
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an  equal  reduction  in  the  excess  space  dimension.  Any  increase  in 
the  dimension  of  r,  to  vt  must  come  as  the  result  ot  a 
corresponding  decrease  in  vrj  to  irr  Hence. 


for  all  j  st  k.  Since  the  total  increase  in  the  dimension  of  those 
detection  spaces  other  than  that  of  fk  must  equal  the  reduction  in 
the  excess  space  dimension,  then  it  must  be  true  that 


£  (  e,  -  *0 )  =  S  <  v'>  ~  >  =  "<■  -  *v 


where  j  *  k  is  a  constraint  on  the  summation  terms.  Equations 
similar  to  (F.l)  and  (F.2)  can  also  be  written  for  the  second 
and  successive  extensions,  although  some  of  the  terms  in  the 
summations  of  (F.2)  will  be  zero  since  some  directions  will  have 
been  removed  from  the  excess  subspace. 

For  notational  simplicity  it  is  assumed  here  that  r  =  3.  The 
state-space  dimension  must  be  enlarged  to  n,  =  n  +  r,i  +  r,2 
after  r  -  1  applications  of  Theorem  5  and  where  k  =  1,2,  •••on 
successive  applications.  The  reason  for  r  —  1  rather  than  r 
applications  of  Theorem  5  will  become  clear. 

The  first  state-space  enlargement  removes  the  components 
along/,  from  the  excess  space  and  (F.2)  can  be  employed  to  write 
r,  -  r,  =  (r,,  -  r,2)  +  (r,3  -  r,2).  Substitution  of  this  equation 
into  the  previous  equation  produces  n,  =  n  +  (S]a|  vrk)  -  v,  + 
(r,  -  r,j)  which  has  the  form  of  (28)  except  for  the  final  two 
terms.  These  two  terms  are  equal  and  cancel  each  other  out.  Since 
the  first  enlargement  removes  all  excess  space  components  along 
/,.  the  new  excess  space  of  dimension  r,  can  have  components 
only  along  f,  and  /,.  Since  each  excess  eigenvector  must  have  a 
component  in  two  or  more  detection  spaces  (otherwise  the 
eigenvector  would  lie  in  and  span  some  detection  space),  then  r, 
=  r,2  =  *V).  The  second  state-space  enlargement  will  remove/: 
components  from  the  excess  space  and  also  enlarges  the  /5 
detection  space.  Hence,  the  excess  space  will  be  eliminated  after  r 
-  1  applications  of  Theorem  5. 

The  above  analysis  may  be  repeated  for  an  arbitrary  r  and, 
therefore,  Theorem  6  is  proven.  □ 

Appendix  G 
Proof  of  Theorem  7 

The  conditions  under  which  output  stationarity  is  possible  are 
intimately  related  to  the  relationships  between  the  closed-loop 
eigenvector  sets  of  the  0*’ s  and  the  id's,  as  well  as  between  the 
closed-loop  eigenvalues  of  X*  and  A'.  The  X*’s  and  the  0*’s 
represent  the  closed-loop  eigenvalues  and  eigenvectors  of  the 
filter  designed  as  if  hk  was  one  of  the  original  output  separable 
failure  directions.  Because  of  the  assumption  that  lies  in  the 
union  of  the  detection  spaces  of  the  /  for  /  =  1 ,  2,  •  ■  • ,  /,  then  £* 
and  hk  can  be  written  as  in  the  remark  after  Theorem  7.  The 
implication  here  is  that  hk  or  any  detection  equivalent  direction 
can  be  written  as  a  unique  linear  combination  of  the  eigenvectors 
which  span  the  detection  spaces  of  the  /  for  i  =  1,2,  •••,/. 

The  detection  space  of  hk  also  has  a  fundamental  role  in  the 
development  of  the  output  stationaiity  conditions.  The  projection 
of  the  hk  detection  space  onto  each  and  every  detection  space  with 
respect  to  f  (or  i  =  1,2,  •  •  •  can  be  shown  to  be  of  dimension  0k 
[6|.  The  implications  of  this  are  that  there  are  only  r*  eigenvalues 
which  may  be  assigned  to  the  detection  space  of  hk.  Since  the 
detection  space  of  hk  has  r* -dimensional  projections  on  each  of  the 
/detection  spaces  of  the /,  of  (29)  which  comprise  hk,  it  may  be 
deduced  that  the  same  arbitrarily  assignable  set  of  vk  eigenvalues 
must  be  a  subset  of  the  r,  eigenvalues  in  each  of  the  1  detection 
spaces  since  vk  <  r,.  The  unassignable  nature  of  the  r,  -  f, 
eigenvalues  when  r'»  <  r,  will  become  apparent  later  in  the  proof. 

The  temporary  assumption  is  made  here  that  r,  =  r,  for  all  i  = 


1,2,  •••,/.  The  implication  of  equating  hk  written  in  terms  of  ak 
and  0*  from  the  analogous  form  of  (11)  with  (30)  is  that 

v‘  =  (5'i'y'  +  •  ■  •  +SV)/d‘.  (G  1 ) 

Hence,  the  structural  requirements  for  the  output  stationarity  of  hk 
are  that  the  closed-loop  eigenvectors  of  the  detection  spaces  of  the 
hk  and  the  /  for  /  =  1,  2,  •  ■  ■,  /  be  linearly  related  as  in  (G.l). 
Although  these  arguments  have  been  based  on  the  assumption  that 
r*  =  r,,  the  proof  to  follow  will  show  that  the  unassignable 
eigenvalues  that  occur  for  vk  <  r,  become  fixed  as  the  result  of 
preserving  the  relationship  of  (G.l)  to  fix  the  output  direction  of 
CV 

The  proof  will  show  in  the  spirit  of  Theorem  1  that  the 
hypothesized  conditions  are  necessary  and  sufficient  for  the  output 
direction  of  h ,  to  be  fixed  by  the  same  detection  gain  as  for  the 
directions  in  F. 

The  necessity  of  the  conditions  in  Theorem  7  can  be  shown  in  a 
manner  analogous  to  that  of  th»  proof  of  Theorem  I .  An  equation 
derived  like  that  of  (A. 5)  but  in  terms  of  hk  and  the  eigenvalues 
and  eigenvectors  w  ith  respect  to  its  detection  space  is 


where  Cu*  =  CVr,  for  a  fixed  output  direction  by  Theorem  I. 
Similarly,  an  output  error  equation  can  be  written  for  hk  in  terms 
of  the  eigenvalues  and  eigenvectors  of  the  detection  spaces  of  the 
/’s  [i.e. .  from  (30)1  for  i  =  1,2,  •  •  • ,  /  as 

+  S  S  (rr)  M  •  <G  3) 

,  =  |  J  r  ,k  .  |  \  /  /  J 


The  form  in  which  (G.2)  is  written  clearly  indicates  that  when  vk 
=  r,  and  A‘  =  X' ,  then  (G.l)  implies  that  (G.3)  and  (G.2)  are 
identical  ana  the  output  direction  of  Chk  is  fixed  if  the  directions 
C/,.  •  •  •,  Cf,  have  been  fixed  by  Theorem  1.  If  vk  <  r(,  the 
output  stationarity  of  hk  can  only  be  ensured  if  \*  =  for  j  -  l, 
2,  •  •  r*.  and  the  V's  of  the  second  term  in  (G.3)  are  chosen 

such  that  the  corresponding  S' 's  are  zero.  This  is  the  reason  why  r, 
-  vk  eigenvalues  must  be  unassignable  for  each  detection  space 
where  vk  <  r,.  The  necessity  of  the  Theorem  7  conditions  for 
output  stationarity  has  now  been  shown. 

The  sufficiency  of  the  conditions  in  Theorem  7  can  also  be 
show  n  in  a  manner  similar  to  that  of  the  proof  of  Theorem  1.  An 
equation  analogous  to  that  of  (A.  1)  in  terms  of  hk  becomes  W/  = 
\hk,  Ghk,  •  ■■,  G"  'hk J  where  Cll't,  must  be  of  rank  1  if  the 
output  direction  Chk  is  to  be  fixed.  An  equation  in  terms  of  the  ht 
eigenvalues  and  eigenvectors  can  be  written  which  is  analogous  to 
(A. 2)  as 


L  /  - 1  j  -  i 


i = i 


where  the  conditions  of  Theorem  I  would  imply  that  C Wh  is  of 
rank  1  since  ChK  -  CO*.  Similarly.  (30)  can  be  used  to  rewrite 
in  terms  of  the  eigenvalues  and  eigenvectors  of  the  detection 
spaces  of  the  /  for  /  =  1,2,  •  ■  ■ ,  /  as 


\ii  h:- it 

L  /  ii-i  /  -  i  ci 
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If  now  the  S]  =  0  for  all  j  >  vk  when  v,  >  vk,  and  the  vk  freeiy 
assignable  eigenvalues  are  chosen  such  that  V  =  X*.  then  (G.5) 
can  be  rewritten  as 


(10)  J.  S.  Meserole,  Jr.,  "Detection  filters  for  fault-tolerant  control  of 
turbofan  engines."  The  Charles  Stark  Draper  Laboratory,  Cambridge, 
MA,  Rep.  f-751,  June  1981. 


[>'»  !  'k  I  | 

S2‘X-2SW»;-- 

i  i-i  j - i  i-i  -i 


such  that  premultiplication  by  Cand  (G.l)  produce  (G.4).  Hence, 
C(Vk  can  be  made  rank  1  if  the  directions  Cft,  ■  •  ■ ,  Cf  have  been 
fixed  by  Theorem  1,  and  the  sufficiency  of  the  Theorem  7 
conditions  for  the  output  stationarity  of  hk  with  F  has  been 
proven.  □ 
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Modeling  of  Parameter  Variations  and  Asymptotic 

LQG  Synthesis 

MIN'JEA  TAHK.  member.  ihhh.  and  JASON  L.  SPEYER,  m  iaow,  ieee 


Abstract— Conventional  approaches  in  modern  robustness  anti  sensi- 
tivils  liteorx  are  not  adequate  for  the  problems  associated  with  parameter 
sanation  since  the  structure  of  parameter  variations  cannot  be  modeled 
properls  or  included  in  the  ssnlhesis  procedure.  A  new  modeling 
technique  is  proposed  to  handle  a  class  of  structured  plant  uncertainties  in 
a  direct  was.  I  he  ke>  is  to  treat  deterministic  parameter  variations  as  an 
internal  feedback  loop  so  that  the  structure  of  parameter  variations  is 
embedded  in  its  model.  An  asvmptotic  LQG  design  ssnlhesis  based  on 
this  modeling  method  is  also  presented.  An  important  relationship 
between  the  structure  of  plant  uncertainties  and  the  LQG  weighting 
matrices  is  obtained.  This  relationship  clearlv  specifies  the  kind  of 
parameter  variations  allowable  for  the  LQG/l.TR  method. 


I.  Introduction 

ONE  aspect  of  current  development  of  MEMO  (multiinput. 

muitioutput)  feedback  system  theory  has  been  concerned  with 
stability  robustness  and  sensitivity  to  plant  perturbations.  Impor¬ 
tant  developments  in  this  field  are  found  in  the  LQGLTR  (loop 
transfer  recovery  i  techniques  ( 1 1-(4|  and  //“-optimization  theory 
associated  with  robustness  and  sensitivity  [5]-[8|.  Although  these 
modern  techniques  are  useful  in  treating  unmodeled  dynamics  and 
stochastic  uncertainties  such  as  disturbances  and  sensor  noises, 
they  may  not  be  adequate  in  handling  structured  parameter 
variations.  In  their  recent  paper  [18|,  Shaked  and  Soroka  showed 
ihat  an  LQG  controller  designed  by  the  LQG/LTR  method  suffers 
from  a  stability  robustness  problem  due  to  a  small  parameter 
variation.  Since  an  LQG  LTR  controller  is  known  to  recover  the 
guaranteed  stability  margins  of  an  LQ  regulator  or  a  Kalman- 
Bucv  filter  [3],  their  result  implies  that  a  conventional  usage  of 
stability  margins  is  no  guarantee  against  a  disastrous  loss  of 
stability. 

The  existing  modeling  methods,  on  which  the  current  robust¬ 
ness  and  sensitivity  studies  are  based,  are  external  descriptions  of 
the  plant  uncertainties,  in  the  sense  that  plant  uncertainties  are 
modeled  at  the  exterior  of  the  plant  by  assigning  extra  blocks  at 
the  input,  at  the  output,  or  around  the  plant  as  feedback  or 
feedforward  loops  [1 1|.  In  practice,  these  modeling  methods  are 
not  convenient  at  all  in  handling  parameter  uncertainty.  Many 
difficulties  arise  from  the  fact  that  parameter  uncertainties  are 
usually  given  in  state-space  forms  while  the  conventional  uncer¬ 
tainty  models  are  based  on  transfer  function  descriptions.  In 
Section  II  we  discuss  these  drawbacks  in  some  detail  and  identify 
the  inadequacy  of  the  conventional  uncertainty  models  for 
parameter  uncertainty  as  a  source  of  robustness  problems. 

Apart  from  the  well-known  modern  synthesis  methods,  other 
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methods  have  been  proposed  which  address  the  robustness 
problem  of  parameter  uncertainty  within  the  framework  ot  state- 
space  representation.  A  matching  condition  is  crucial  in 
Lyapunov-function  approaches  [23 1 .  [25 1  so  that  the  class  of 
parameter  variations  to  he  considered  is  severely  limited,  although 
some  relaxation  was  obtained  in  1 24 1  The  matching  condition 
assumes  that  a  parameter  variation  of  the  state  matrix  .-I.  denoted 
as  AH.  is  spanned  by  the  input  matrix  B  or  the  output  matrix  C. 
The  special  characteristics  of  this  type  of  parameter  uncertainty 
will  be  discussed  later.  Another  approach  which  deals  xvith  a 
larger  class  of  parameter  variation  is  the  stochastic  modeling 
method  using  state-,  control-,  and  measurement -dependent  multi¬ 
plicative  noises  |26],  [ 27 1 .  This  method  leads  to  a  direct  synthesis 
which  requires  the  coupled  solution  of  two  Riccati  equations  and 
two  Lyapunov  equaiions.  However,  this  stochastic  modeling 
method  does  not  directly  address  the  robustness  problem  associ¬ 
ated  with  modeling  errors  such  as  parameter  variations.  Other 
synthesis  methods  related  to  parameter  uncertainty  are  found  in 
[28] .  [ 29] .  Most  of  these  state-space  methods  simply  describe  a 
parameter  variation  as  a  difference  between  the  state-space 
representation  of  the  nominal  system  (or.  model)  and  that  of  the 
perturbed  system  (or.  real  system).  Thereby,  one  objective  of  this 
paper  is  to  better  understand"  the  role  of  the  structure  of  parameter 
variations  in  the  development  of  robust  synthesis  techniques. 

This  paper  circumvents  some  of  the  difficulties  and  drawbacks 
of  existing  methods  by  using  a  modeling  method  which  is  able  to 
characterize  the  structure  of  parameter  variations  in  a  simple  way. 
In  this  method,  a  parameter  variation  is  represented  as  an 
equivalent  fictitious  feedback  loop  called  the  internal  feedback 
loopt IFL).  In  particular,  we  are  using  the  fact  that  a  feedback  and 
a  parameter  variation  are  indistinguishable  when  input-output 
relations  are  considered.  The  advantages  of  the  IFL  modeling 
method  over  the  existing  methods  are:  I)  it  is  simple;  2)  the 
associated  stability  criterion  has  no  restriction  on  the  closed-right- 
half-planc  (CRHP)  poles  and  zeros  as  in  other  methods  [  1 7] ;  3) 
the  structure  of  parameter  variations  is  maintained;  and  4)  several 
modern  design  methods  can  incorporate  the  IFL  model  directly. 

In  IFL  modeling  a  parameter  variation  AH  is  decomposed  into 
three  parts:  the  input,  output,  and  feedback  matrices.  This 
decomposition  is  called  the  input/output  (I/C)  decomposition. 
The  idea  of  the  IFL  representation  or  the  I/O  de  romposition  is  not 
new.  Recently,  various  authors  have  employed  this  idea  either 
implicitly  or  explicitly  in  order  to  study  parameter  uncertainty 
[14],  [20].  1 30] -[32 1 .  However,  Mita  and  Ngarnkajornvivat  [33] 
seem  to  be  the  first  to  use  the  concept  of  the  I/O  decomposition  to 
develop  a  synthesis  method,  which  was  generalized  later  by 
Shaked  [34|.  Their  studies  were  limited  to  state-feedback  prob¬ 
lems  and  the  major  concern  was  pole  sensitivity  rather  than 
robustness.  Section  II  briefly  discusses  the  IFL  modeling  tech¬ 
nique  which  transforms  a  perturbed  closed-system  into  a  two- 
input.  two-output  (TITO)  system.  The  idea  of  representing 
general  plant  uncertainties  as  a  feedback  loop  was  also  previously 
suggested  in  [12],  hut  parameter  uncertainty  was  not  explicitly 
treated. 

The  main  purpose  of  this  paper  is  to  propose  an  asymptotic 
LQG  design  synthesis  based  on  the  I/O  decomposition  of 
parameter  variation.  Section  III  shows  that,  by  selecting  proper 
weighting  matrices  for  the  Riccati  equations,  either  the  regulator 
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pan  or  filter  pan  ol  an  LQG  controller  can  he  made  asymptoti¬ 
cally  robust  to  a  given  parameter  variation.  In  fact,  there  evicts  an 
explicit  relationship  between  the  LQG  weighiins  matrices  and  the 
structure  of  the  parameter  variation.  The  stability  robustness  of 
the  LQG  control  system  is  then  determined  solely  bv  the  other 
nonasv mptotic  pan.  This  implies  that  an  observer  based  on  the 
Kalman  filter  can  be  designed  bv  selecting  suitable  covariance 
matrices,  in  order  to  recover  the  robustness  of  the  LQ  regulator 
with  respect  to  a  given  parameter  variation.  This  asymptotical 
procedure  generalizes  the  LQG  I.TR  design  to  a  larger  class  of 
parameter  variations.  This  paper  also  shows  that  the  class  of 
parameter  variations  which  can  be  safely  treated  by  the  LQG  LTR 
method  is  limned  by  the  structure  of  the  input  and  output  matrices. 
Some  numerical  results  based  on  [  1S|  are  also  given  in  Section  IV 
to  illustrate  the  advantage  of  this  asy  mptotic  LQG  design  synthesis 
over  the  conventional  LQG  LTR  technique  in  the  presence  of 
parameter  uncertainty . 

II.  Mow-ling  of  Parameter  Variations 

Two  important  classes  of  plant  uncertainties  are  unmodeled 
dynamics  tor  truncated  higher  order  dynamics)  and  parameter 
uncertainty.  Since  these  uncertainties  arise  from  the  inaccuracy  or 
incompleteness  of  mathematical  models,  they  are  often  called 
modeling  errors.  In  this  section,  we  are  concerned  with  the 
modeling  of  modeling  errors  rather  than  the  modeling  of  real 
plants. 

A.  Drawbacks  of  Conventional  M  deling  Techniques 

Let  G(s)  and  Gfsl  be  the  real  plant  and  its  reduced-order  model, 
respectively.  Suppose  that  exact  system  parameters  are  known  and 
the  only  plant  uncertainty  is  unmodeled  dynamics.  For  G(s)  to  be 
an  acceptable  model,  the  frequency  behavior  of  G(s)  should 
approximate  that  of  G(s)  in  a  reasonable  manner  over  a  certain 
frequency  interval  specified  by  the  designer.  Mathematically,  the 
modeling  error  due  to  unmodeled  dynamics  can  be  specified  in 
several  ways.  Two  common  models  of  modeling  errors  are 

EJs)  :=  G(s)-G(s ) 
which  is  additive,  and 

Em(s)  :=  G(s)  '[G(s)-G(s)I 

which  is  multiplicative.  In  practice,  the  exact  form  of  £„(s)  or 
£„(j)  is  neither  available  nor  necessary.  Instead,  norm  bounds  of 
these  error  models  are  usually  sufficient  for  analysis  and  design 
synthesis.  Usually,  unmodeled  dynamics  are  assumed  to  be 
dominant  in  the  high-frequency  range  and  the  norm  bounds  of 
Ej(s)  and  E^(s)  are  determined  in  rather  empirical  ways. 

Now  consider  the  parameter  variation  case.  Suppose  that  G(s) 
and  G(.y)  are  of  the  same  order  but  some  parameter  uncertainties 
are  present,  i.e.,  G(s )  =  G(s,  p)  and  G(s)  =  G(s,  p)  where  p  is 
the  nominal  parameter  vector  used  in  the  model  and  p  is  the  real 
parameter  vector.  Then,  the  error  models  £„(5)  and  Em(s)  become 

EJs)  :  =  G(s,p)-G(s,  p) 

Em (s )  :  =  G(s,  p) '  '[G(s,  p )  -  G(s,  p) |. 

Suppose  that  the  parameter  uncertainties  are  parameterized  by  r 
independent  variables  r  :  =  {<i, r:,  •■•,£,}.  i.e.,  p  -  p  =  /((>. 
This  parameterization  can  be  done  easily  with  the  state-space 
representations  if  the  model  and  the  real  plant  are  assumed  to  be 
given  as  (/I.  B.  C)  and  (A,  B,  C),  respectively.  Then,  the 
parameter  uncertainties.  4,4  =  A  -  A,  45  =  B  -  B.  and  4C 
=  C  -  C.  are  given  as  4/1  =  4/1(0,45  -  45(e).  and  4C  = 
4C(e)  However,  it  is  readily  observed  that  the  computation  of 
Efs,  <)  or  £„(s,  r l  is  not  an  easy  task.  Let 


where  each  norm  is  assumed  to  be  defined  appropriately  .  Whil 
the  degree  of  parameter  variation  is  directly  given  bv  6., 
relationship  between  o.  and  <Q.  or  between  6.  and  o,„  is  extremel 
complicated  even  tor  a  single  parameter  variation,  except  for  6, 
=  0  it  bj  -  0  (or.  6,  =  0  it  o„,  =  0).  Apart  from  its  complexit 
and  inconvenience,  the  use  of  conventional  error  models  fo 
parameter  variations  leads  to  a  loss  of  information  about  th 
magnitude  of  parameter  variation  let  alone  the  loss  of  its  structura 
information. 

The  inadequacy  of  the  conventional  frequency-domain  erro 
models  for  parameter  uncertainty  is  important  in  light  of  stabilit 
robustness  since  their  use  in  robustness  analysis  may  lead  to  a 
incorrect  conclusion  on  the  stability  robustness  of  a  svstem  being 
considered.  For  example,  the  stability  margins,  w  hich  are  closely 
related  to  the  multiplicative  error  model  E„(s).  are  not  useful  if  a 
small  parameter  variation  possibly  produces  a  very  large  gain 
variation  or  phase  variation.  For  this  case,  a  substantial  amount  of 
gain  margin  or  phase  margin  cannot  be  a  guarantee  for  good 
robustness.  A  good  example  for  this  situation  is  found  in  ( 18],  as 
discussed  in  Section  I. 

Another  drawback  of  the  conventional  methods  lies  in  the 
limitations  in  apply  ing  stability  criteria  based  on  the  conventional 
error  models.  The  basic  assumption  of  Lethomaki's  stability 
criteria  (10|.  (Ill.  which  is  the  basis  of  the  M1MO  stability 
margin  concept,  is  that  the  perturbed  plant  has  the  same  numbers 
of  poles  and  zeros  as  the  nominal  plant  in  the  CRHP.  This 
restriction  on  the  perturbed  plant  was  pointed  out  and  compared  to 
the  inverse-Nvquist-based  stability  criteria  [17],  which  also 
assumes  that  the  nominal  plant  and  the  perturbed  plant  share  the 
same  number  of  zeros.  Thus,  the  class  of  plant  uncertainties 
properly  described  by  any  of  the  conventional  error  models  is 
limited  by  this  requirement.  It  is  important  to  note  that  those 
constraints  on  the  perturbed  plant  result  in  a  certain  cla.-,s  of 
parameter  variations,  which  may  destabilize  the  system,  being 
excluded  from  consideration.  Therefore,  we  see  that  the  unstruc¬ 
tured  plant  uncertainties  considered  in  (I0|  arc  not  strict! v 
unstructured,  but  there  exists  a  definite  requirement  on  the 
structure  of  plant  uncertainty.  For  general  parameter  variations,  a 
small  parameter  variation  does  not  necessarily  induce  small  gain 
and  phase  variations  |i.e.,  a  small  £„(s)].  and  does  not  necessar 
ilv  keep  the  same  number  of  CRHP  poles  and  zeros. 

These  observations  lead  us  to  the  conclusion  that  the  convcn 
tional  modeling  methods  for  plant  uncertainties  and  the  associated 
stability  criteria  may  not  be  a  reliable  tool  when  parameter 
uncertainty  rather  than  unmodeled  dynamics  is  involved  in  control 
system  design. 

B.  Internal  Feedback  Modeling  of  Parameter  I'ariations 

By  the  I/O  decomposition,  a  parameter  variation  is  equivalently 
represented  as  an  interna!  feedback  loop,  and  then  the  perturbed 
plant  is  depicted  as  a  TITO  system  where  one  feedback  loop  is  the 
nominal  feedback  loop,  and  another  feedback  loop  is  for  the 
parameter  variation.  This  representation  of  parameter  variation  is 
attractive  in  many  ways:  1 )  there  is  no  restriction  on  the  number  of 
CRHP  poles  or  zeros  of  the  perturbed  plant:  2)  the  structure  of  the 
parameter  variation  is  easily  embedded  into  the  input  matrix  and 
output  matrix  of  the  IFL:  and  3)  the  magnitude  of  the  parameter 
variation  is  directly  described  by  the  magnitude  of  the  feedback 
gain  of  the  IFL. 

Consider  a  linear,  time-invariant  system  where 
,v  =  /l.v  +  Bu 


represents  a  nominal  system,  and 


x  =  ,4.v  *  Bu 


n.  r  .  kj  EJs,  of,  =«£„<*,  oil 


y  =  C.v 
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represents  its  perturbed  system.  The  vectors  .v  £  J{",  u  £  Ilm. 
and  v  £  liJ  denote  the  state.  the_  input,  and  the  output, 
respectively.  We  assume  that  A/4  :  =  A  -  A  *  0.  but  A5 :  =  B 
-5  =  0  and  AC  :  =  C  —  C  =  0,  i.e.,  only  the  state  matrix  is 
subject  to  variation.  It  will  be  shown  that  A5  or  AC  can  be 
embedded  by  an  approximation  procedure  into  a  AH  of  an 
augmented  system. 

Suppose  that  A.4  is  parameterized  as  a  function  of  r  variables  e 
=  {t i,  and  given,  as 

r 

AH(f)  =  V  S,c, 

i  =  i 

where  the  S,'s  are  constant  matrices.  By  decomposing  S,  as  S,  = 
we  can  rewrite  A.4  as 

A/4(0  =  -AfLU)N 

where  M  £  11" Kp  and  S  £  Hi*"  are  constant  matrices 
determined  by  M,' s  and  ;V,'s,  and  L(e)  £  lipxq  is  a  matrix 
function  off  =  {e ; ,  ■  •  • ,  t, } .  The  decomposition  described  above 
is  called  here  an  I/O  decomposition  of  A/4.  Note  that  the  I/O 
decomposition  is  not  unique.  To  avoid  problems  with  an  unneces¬ 
sarily  large  M  or  N.  we  assume  that  the  decomposition  is  such  that 
M  and  N  are  full  rank;  i.e.,  M  and  N  are  of  minimal  dimensions. 
This  nonuniqueness  is  not  important  in  stability  analysis  and 
design  synthesis,  as  shall  be  discussed  later. 

Given  an  I/O  decomposition  of  A/4(e),  the  perturbed  plant  can 
be  written  as 


n 

JL— H  g"  g>’  — 

r — 

1 - [uoW-J 


(a) 


<b) 


(c) 

Fig.  1.  TITO  system  representations  of  the  perturbed  system. 

conventional  methods,  are  also  examined.  This  stability  criterion 
is  also  independent  of  the  I/O  decomposition.  Suppose  that  — 
ML(t)N  is  an  I/O  decomposition  of  the  A/4.  Then 


x-Ax  +  Bu  +  Mw 


det  [/  +  aL(e)Gii(j<j)] 


y  =  Cx 


Z  =  Nx 


=  det  lI+aL(()N($-oBK(/  +  gntC)-'C<t>)M] 
=  det  j  /  +  aML(t)N(<i>  -  <*>BK(I  +  gn  A')"  'C<f>)] 


w=  -L(t)z 


=  det  [I  -  ct&A  (<i>  -  d>BK(I  +  guK)~  'Cd>)]. 


where  two  variables  z  and  w  are  introduced,  respectively,  as  an 
auxiliary  output  and  an  input  connected  to  the  internal  feedback 
loop  with  a  gain  Z-(e),  as  shown  in  Fig.  1(a)  where 


g 


and  <t>  =  (si  -  A)’1. 

Let  K(s)  be  a  compensator.  Then,  it  is  easy  to  see  that  the 
perturbed  closed-loop  system  shown  in  Fig.  1(b)  is  equivalent  to 
Fig.  1(c)  where 


£n 

?I2 

C<i>B 

C0A/1 

<?21 

822  . 

N<t>B 

N<t>M  \ 

1 
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1 _ 
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J  L  - 
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For  a  given  e 
that 


a  sufficient  condition  for  closed-loop  stability  is 


det  \I  +  aL(t)Gz:(jw)]*0 

for  all  a  £  [0,  I]  and  <a  £  It  [12],  [14).  When  only  parameter 
uncertainty  is  considered,  the  above  criteria  gives  not  only  input- 
output  stability  but  also  the  internal  stability  of  the  closed-loop 
system  ( 1 9 1 .  In  other  words,  the  stability  of  the  unobservable 
or  uncontrollable  modes,  which  cannot  be  studied  by  the 


Therefore,  the  lack  of  uniqueness  of  the  decomposition  does  not 
affect  this  stability  criterion. 

C.  Parameter  Variations  of  the  Matrices  B  and  C 

For  parameter  variations  in  5  and  C,  the  I/O  decompositions 
can  be  obtained  in  several  ways.  One  way  is  to  construct  the  state- 
space  representation  of  the  closed-loop  system  (zl^,  Bc,  Cr)  and 
(Ac,  Bc,  Cc),  and  to  obtain  an  I/O  decomposition  for  AAC  =  Ac 
-  Ac-  However,  this  method  requires  a  state-space  representa¬ 
tion  of  the  feedback  compensators.  Or  we  can  construct  aug¬ 
mented  state-space  representations  (Aa,  Ba,  Ca)  and  (Aa,  B„,  C„) 
where  B„  =  5„,  C„  =  C„  but  &A„  =  Aa  -  A„  approximates  A 5 
and  AC.  The  latter  is  more  favorable  since  a  unified  approach  is 
possible  for  all  kinds  of  parameter  variations  so  it  will  be 
discussed  in  detail  in  this  section.  Some  other  methods  for  A5  and 
AC  are  also  found  in  [20],  [36]. 

Suppose  that  the  /lh  column  of  5  is  subject  to  a  perturbation. 
Then  we  augment  the  state  vector  x  by  an  extra  state  xb  which 
follows  the  input  u,  fast  enough  so  that  its  dynamics  arc  negligible 
when  compared  to  other  modes  of  A.  Effectively,  w,  becomes  a 
state,  and  the  plant  is  approximated  by 

where  a  is  a  sufficiently  large  positive  number,  b,  is  the  /th 
column  of  the  matrix  5,  and 


50  =  5  -  ]  0  0  0] 

5o  =  (0  0  -  •  •  a  ■  ■  ■  0], 
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In  faci.  the  transfer  function  from  u,  to  xh  becomes 
a 

xb(s)  = - u,(s),  a  >  1. 

s  +  a 

Similarly,  for  the  y th  row  of  C,  which  is  subject  to  a 
perturbation,  we  can  augment  the  state  space  as 

UHs  -0 [;]♦[;]■ 

,.|C„  C  ['] 
where  r  >  1,  c;  is  the  jth  row  of  C,  and 

Co  =  C-[00  c]  0]r 
co  =  [0  0  •  •  •  r  •  ■  ■  0]7". 

This  augmentation  gives 

y,(s)  =  — —  xc(s),  t  >  1. 

S  +  T 

It  is  easy  to  prove  that  controllability  and  observability  are  not 
affected  by  the  above  state  augmentation  procedures.  The  poles 
and  zeros  of  the  original  system  are  also  those  of  the  augmented 
system,  and  the  only  alteration  is  the  addition  of  some  poles  on  the 
negative  real  axis,  which  are  well  beyond  the  bandwidth  of  the 
plant.  The  proofs  are  simple  and  left  to  the  reader.  Since  A B  and 
AC  can  be  approximated  as  AAa  of  an  augmented  system,  it  is 
assumed  from  now  on  that  theperturbed  plant  does  not  have 
perturbations  in  B  and  C,  i.e.,  B  =  B  and  C  -  C. 

III.  An  Asymptotic  LQG  Design  Synthesis 

This  section  introduces  an  asymptotic  LQG  design  procedure 
based  on  the  internal  feedback  modeling  method  described  in  the 
previous  section.  It  is  shown  that  the  finite  poles  of  either 
the  regulator  or  filter  of  an  LQG  control  system  can  be  Jesigned  to 
be  asymptotically  insensitive  to  a  specified  parameter  variation  via 
a  suitable  se'ection  of  LQG  weighting  matrices.  These  weighting 
matrices  turn  out  to  be  closely  related  to  the  structure  of  a 
particular  class  of  parameter  variations.  The  robustness  of  the 
LQG  control  system  is  essentially  determined  by  the  remaining 
wnsiuse  pan  ot  the  LQG  compensator.  In  other  words,  the 
romjstness  of  the  LQG  control  system  recovers  either  the 
ronponess  of  the  LQ  regulator  or  that  of  a  Kalman-Bucy  filter. 

1  he  robustness  problem  again  reduces  to  determining  the  robust- 

■  'i  citner  the  LQ  regulator  or  Kalman-Bucy  filter.  Fur'-er- 
•••■  f  the  1  QG  LTR  procedure  may  be  considered  a  special  case 
•  -  ,  .I, . mpto'ie  LQG  method,  although  the  explicit  connection 
nunt  uncertainties  and  the  LQG  weighting  matrices  had 

■  -  i.le 

■'  \  arialtons 

i  ■  strus tured  parameter  variations  has  great 
-  .  tv  t tie  jsvmptotic  behavior  of  LQG  poles. 

-  ■  i •  i.  ter:, tics  .ire  given  here. 

\f  c  £ 

t  -len  \f  i.  ni  he  column -similar  to 
M  .in. i  V  is  said  to  be  row-similar 

>  t,  i  matrix  P  such  that  A/ \  = 

,  \l  in.1  that  there  exists  a  matrix 

•>  ■  "iij.tr  to  V 


Definition  2  (Similar  Parameter  Variations):  Consider  a 
nominal  plant  (A,  B,  C )  and  its  penurbed  plant  (A,  B,  C)  whert 
A  A  =  A  —  A  =  —  A/LA'.  Then,  A/1  is  said  to  be  input-similai 
if  A /  is  column-similar  to  B,  and  it  is  said  to  be  output-similar  i| 
A  is  row-similar  to  C.  It  is  also  called  Similar  if  it  is  input-similar 
or  output-similar. 

An  important  characteristic  of  similar  variations  is  that  they  do 
not  perturb  the  zeros  of  the  plant.  Let  Z,  and  Zrf  be  the  sets  of 
transmission  zeros  and  decoupling  zeros  of  the  nominal  plant  (A, 
B,  C).  respectively.  Similarly,  we  define  Z,  and  Zd  as  the  sets  of 
transmission  zeros  and  decoupling  zeros  of  the  penurbed  plant 
(A,  B ,  C).  (We  will  follow  the  definition  of  zeros  given  in  1161 
[22].) 

Lemma  1:  If  A/1  is  similar  (i.e.,  input-similar  or  output- 
similar),  then 

z,  u  zd=z,  u  zd. 

Proof:  For  input-similar  variations. 

Therefore,  the  system  zeros  are  invariant.  The  proof  is  similar  for 
output-similar  variations.  □ 

B.  Asymptotic  Pole  Sensitivities 

Consider  an  output  feedback  system  with  a  constant  gain 
feedback  given  as 

x  =  Ax+  Bu 
y=Cx 
u  =  v~  kFy 

where  B  €  C  S  Slmxn,  F  E  are  assumed  to  be  full 

rank,  and  k  E  Jl.  The  numbers  of  the  input  and  output  are 
assumed  to  be  the  same. 

Lemma  2:  Suppose  that  G(s)  =  C(sl  -  tp'B  has  j,  finite 
transmission  zeros.  Then  j,  poles  of  the  closed-loop  system,  4  - 
kBFC,  asymptotically  approach  the  transmission  j,  zeros  of  G(s) 
as  k  oo. 

This  is  a  well-known  feedback  property  for  single-input,  single¬ 
output  (SISO)  systems.  Lemma  2  .mplies  that  the  number  of  the 
finite  eigenvalues  of  A  -  kBFC  as  k  -*  oo  js  equal  to  j,  +  jd 
where  jd  is  the  number  of  decoupling  zeros  corresponding  to 
unobservable  and/or  uncontrollable  modes  and  j,  is  the  number  of 
asymptotically  finite  closed-loop  poles.  For  the  MIMO  case,  its 
formal  proof  can  be  found  in  [21,  Theorem  4.3]. 

Theorem  1:  Consider  the  above  output  feedback  system. 
Suppose  that  the  state  matrix  A  is  perturbed  by  a  parameter 
variation  given  as  AA  =  -  A/LA'.  Then,  as  k  -*  oo,  every  finite 
eigenvalue  of  A  -  kBFC  is  asymptotically  insensitive  to  the 
parameter  variation  -  A/LA  if  AA  is  similar. 

Proof:  Consider  a  similar  variation  -/3A/LA  where  0  E 
[0,  1],  Let  PU 3,  k)  be  the  set  of  finite  eigenvalues  of  A  -  (3MLN 
-  kBFC  and  Z(0)  be  the  set  of  zeros  of  {A  -  0MLN,  B,  C). 
For  the  nominal  system  (0  =  0).  we  denote  them  as  /’(O,  k)  and 
Z(0),  respectively.  Lemma  2  implies  that  P(@,  k)  -*  Z(0)  and 
P(0,  k)  -*  Z( 0)  as  k  oo.  However,  fron.  Lemma  1.  Z(/3)  = 
Z(0).  Thus,  lim*-„  P(0,  k)  =  lim*-„  P( 0,  k).  Suppose  that  p  E 
P(0,  k)  and  p  E  P(0,  k)  approach  z  E  Z(0)  as  k  -*  oo.  Then, 
there  exists  a  constant  L(< 5)  for  any  6  >  0  such  that  |z  —  p\  <  5 
and  ;z  -  p\  <  5  for  all  k  >  k(5).  Therefore.  \p  —  p\  <  2b  for 
k  >  k(b).  The  continuity  of  eigenvalues  of  A  -  0MLN  - 
kBFC  with  respect  to  0  completes  the  proof.  □ 
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C.  Asymptotic  Robustness  of  a  Full  State  Regulator  with 
Observer 

Theorem  1  states  that  the  finite  poles  are  insensitive  to  a 
parameter  variation  when  AX  is  similar.  This  property  is  not 
readily  useful  since  the  closed-loop  stability  is  not  guaranteed  by 
simple  output  feedback.  However,  the  results  of  Theorem  1  give 
some  insight  into  the  asymptotic  robustness  of  observer-based 
control  systems.  Consider  a  full-state  feedback  regulator  with 
observer  (FSRO)  for  which  the  state  matrix  of  the  closed  loop 
system  is  given  by 

-  _  F  A  BKC  ] 

r  \_-KfC  A  -  BKC-  K/C  J 

where  A  =  A  -  MLN,  A  €  n B  £  |j C  £  M 
£  nn%p,  N  £  /}«*",  L  £  np*q,  and  Kc  £  Jimx"  and  Kf  £ 
Jlnxl  are  the  regulator  and  observer  gain,  respectively.  For  the 
rest  of  this  paper,  we  assume  that  B,  C,  Kf,  Kc,  M,  and  N  are  of 
full  rank  and  (/l,  B,  C)  is  minimal.  If  AX  =  0,  then  the  closed- 
loop  poles  are  determined  by  A  -  BKC  and  A  -  KfC.  The  choice 
of  Kc  only  affects  the  regulator  part  while  Kf  affects  only  the 
observer  part.  If  AX  =£  0  (i.e. .  A  =£  A),  then  the  perturbed 
closed-loop  poles  are  no  longer  determined  by  A  -  BKC  and  A  - 
K/C.  However,  the  following  theorem  shows  that  the  coupling 
between  the  regulator  and  the  filter  can  be  eliminated  asymptoti¬ 
cally. 

Theorem  2:  Consider  the  above  FSRO  system  whth  a  parameter 
variation  AX  =  -  MLN.  Suppose  that 

Kf=yK,+  K,0(y) 

where  K,  is  a  finite  matrix  and  (1  /y)K/0(y)  —  0  as  y  -*_oo. 
Suppose  that  M  is  column-similar  to  Kf.  Then,  as  y  -*  oo:  l )  A  - 
BKC  determines  half  of  the  closed-loop  poles;_  2)  there  are 
asymptotic  poles  approaching  the  zeros  of  (A,  Kf,  C);  and  3) 
these  asymptotic  poles  are  asymptotically  insensitive  to  the 
parameter  variation  AX  =  -MLN. 

Proof:  Since 


Lemma  2  implies  that,  as  y  —  oo,  the  finite  closed-loop  poles 
approach  the  zeros  of 


Then,  Pfs)  =  P|(s)  +  Pfs).  Suppose  that  z„  is  a  finite  zero  of 
Po(s).  Let  q(  ■ )  and  a(-)  denote  the  minimum  singular  value  and 
maximum  singular  value,  respectively.  Then,  a(Pt ,(;„))  =  0. 
From  the  singular-value  inequality  n(X)  -  6(B)  <  a(A  +  B)  < 
q(A)  +  6(B),  we  have 

q(PoCo))  ~  o(P:(Z0))  <  q(PfZ„))  sqlPfZo))  +  a(Pz(z0)). 

Then,  q(Pfz0))  -*  0  as  7  —  00  since  linx,_„  6(P:(Zv))  =  0. 
Therefore.  is  also  a  zero  of  Pfs)  in  the  limit  so  that  the  finite 
closed-loop  poles  are  determined  by  the  zeros  of  Pfs).  Let 

Hfs)  :  =  C0(s/-A0)-'Bt. 

Then,  a  simple  calculation  gives 

Hfs)  =  C(sI-A)'{  /  +  MLN  (si  -  X  +  BKc)-'}Kf. 

Since  M  is  column-similar  to  Kf.  there  exists  a  matrix  P  such  that 
M  =  KfP.  Then,  Hfs)  can  be  written  as  Hfs)  =  Hf(s)Hc(s) 
where 

Hf(s)  :=C(s/-A)-'Kf 
Hc(s)  :  =  l+PLN(s/-A+BKc)-‘Kf. 

Consider  Hc(s)  first.  Since 

det  [  Hc(s)\  =  det  [  /+  PLN(sl- A  +  BKC)~  'Kf 

=  det  [/+(sl- A +BKc)-'KfPLN) 

=  det  (si  -  A  +  BKC)  - 1  det  (si  -  A  +  BKC  +  KfPLN) 

=  dct  (si- A  +  BKC)~'  det  (si  -  A  +  BKC), 

the  zeros  of  Hfs)  are  the  eigenvalues  of  X  -  BKC.  This  shows 
that  all  the  nominal  regulator  poles,  which  are  determined  by  X  - 
BKC,  are  perturbed  to  the  eigenvalues  of  X  -  BKC  =  X  -  MLN 
-  BKC.  This  proves  pan  1).  For  the  observer  part.  Lemma  1 
implies  that  the  zeros  of  Hf(s)  =  C(sl  -  A )~  '/^are  the  zeros  of 
C(sl  -  A)-'Kf  since  M  is  input-similar  for  the  system  (X,  A), 
C).  Thus,  the  asymptotically  finite  poles  of  the  observer  part  of 
the  perturbed  system  go  to  the  zeros  of  C(s/  -  A)-'Kf,  which 
completes  the  proof  of  part  2).  Finally,  using  the  same  arguments 
as  used  in  Theorem  1.  we  can  easily  show  that  part  3)  is  tnie.D 

The  following  properties  are  direct  from  Theorem  2;  1)  the 
asymptotic  pole  locations  are  determined  by  the  dominant  part  of 
the  observer  gain,  Kf\  and  2)  the  regulator  poles  are  perturbed  in 
the  same  way  as  a  state-feedback  system  X  -  BKC  is  perturbed  by 
AX  when  AX  is  column-similar  to  Kf.  Since  the  asymptotic 
observer  poles  are  insensitive  to  AX  in  this  case,  we  may  suspect 
that  the  robustness  of  the  FSRO  controller  is  determined  by  its 
regulator  part.  This  is  true,  but  we  have  not  considered  the 
sensitivities  of  asymptotic  infinite  poles  yet.  This  robustness 
property  becomes  explicit  in  Theorem  3  to  follow.  We  first 
consider  the  next  lemma  for  its  proof. 

Lemma  3:  Suppose  that  A,  B,  C  are  matrices  such  that  A  +  B 
+  C  is  invertible.  Then, 

A  +  B(A  +B  +  C)  'C=(X  +  C)(X  +fl  +  C)  '(X  +  B). 

Proof:  Let  D  =  (X  +  B  +  C)  ',  then 


Define 


X  +  BDC  =  A  DD  1  +  BDC 


=  ADA  +  ADB  +  ADC  +  BDC. 

Using  (X  +  B  +  C)DC  =  Cand  CD(A  +  B  +  C)  =  C,  we 
obtain 


ADC ♦  BDC=  CD  A  4  CDB. 
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Therefore. 

ADA  +  ADB  +  ADC  +  BDC  =  ADA  +  ADB  +  CDA  +  CDB 

=  (/t  +  C)D(A  +3).  □ 

Theorem  3:  Consider  the  robustness  function  G22(s)  of  the 
asymptotic  FSRO  control  system  of  Theorem  2.  Suppose  that  the 
nominal  closed-ioop  is  stable  and  C(sl  -  A)~'Kj has  no  zeros  in 
the  CRHP.  Then,  as  7  -»  oo, 

G::(ju)  -  .VOW  -A  +  BKC )~ 1 A  1 

pointwise  for  all  co  £  Si. 

Proof:  For  an  FSRO  compensator,  K(s)  is  given  as 
K{s)  =  Kc(sl -A  +  BKC  +  KjCY'Kf. 

Thus,  the  robustness  function  G22(s)  becomes 

Gl:(s)  =  NoSf  -  i\t>BK(I  +  GK)-'C<t>M 

=  N(sI-A+  BKfsI  —  A  +  BKC  +  K/C)~  'A/C) '  'M 
=  N(sl-  A  +  BKC)  “  '(si- A  +  BKC  +  KfC) 

■  (sI-A  +  KjC)-'M 

=  N(sl-A  +  BKcy'M+  N(sl  -A  +  BKC)-'BKC 

■  (sI-A  +  K,C)-'M 

where  Lemma  3  is  used  in  the  third  equality,  and  all  the  inverses 
exist  along  the  imaginary  axis  since  the  closed-loop  is  stable.  Now 
M  =  KjP  since  M  is  column-similar  to  Rf.  Let  K /  =  (l/-y )Kf. 
From  Kf  =  yKf  +  Kfo(y),  we  obtain  M  =  KfP  -  (1/ 
y)K/oh)P-  Then, 

Kc(sI-A+K,C)'M 

=  Af[0-o7A/(/  +  yC<t>K,)  -  >C*  ]  (k/P  -  i  Kfo(y)P^j 

=  Kc4>RjII -y{I+  yC<t>Rf)~xC<t>R/]P 

- Kc\<}>- <t>yKf(I  +  yC<t>Kf)~'C<)>]  -  K/0(y)P 

7 

=  Kc  <t>Kf(  I  +  y  C<t>K,)  ~'P - - Kc  </>  K,a  (7 )  P 
7 

AKc<t>^f(I+yC<t>Hf)-'C<i>K/0{y)P 

where  <t>  =  (si  -A)  For  the  time  being,  suppose  that  A  has  no 
eigenvalues  on  the  imaginary  axis.  Since  CoK,  has  no  CRHP 
zeros,  q(C4>K/)  >  0  along  the  imaginary  axis.  Using  the  singular 
value  inequality  used  in  the  proof  of  Theorem  2,  we  can  show  that 

lim  o(C<t>Kf)  =  o(C<t>l<,). 

7-!» 

Again, 

a(yC<t>Ky)  —  1  £  u(  /  +  7  C<t>K/)  £  a(yC<t>Kf)  +  1 . 

Thus.  q(l  +  yC<t>K/)  -*  oo  as  y  -»  oo  since  q(yC<t>Kf)  = 
yq(C4>Kf)  ->  oo  as  y  -*  oo.  Then,  as  y  -»  oo, 

d(I+yC<bKf)-'  =  q-'(I+yC't>K,)^0. 

Also, 

6{(  /  +  yC<t>Kf)  -  'C<HA7„(7  )l  Sd(l  +  yC<t>K,)  'a(C<i,K/o(y)) 

.  .  ,  /  1  \ 


as  7  -*  oo.  Therefore,  we  see  that 

Kc(juil  ~  A  +  KfC)~lM-*Q  as7-*oo 

and 

lim  G22(ju> )  =  Nijeil  -  A  +  BKC)~ 1 M 

for  all  us  £  Si.  Finally,  we  need  to  consider  the  case  for  which  A 
has  some  eigenvalues  on  the  imaginary  axis  so  that  <t>  -  (si  - 
A)~'  is  not  defined  for  us  £  Si.  However,  this  difficulty  can  be 
avoided  as  follows.  Let  A '  =  A  +  KaC  be  a  matrix  without 
eigenvalues  on  the  imaginary  axis.  Since  (A,  C)  is  assumed  to  be 
observable,  there  always  exist  such  a  K0.  Then, 

si- A  +  K/C=sl  -  A  '  +(K/+ K„)C  =  sl  -  A  '  +  KjC 

where  K  'f  =  y Kf  +  Klo(y)  +  K0  =  yRf  +  K  '0(y).  Then,  we 
can  apply  the  same  procedure  as  above.  □ 

The  robustness  function  G22(jus)  given  in  Section  II-C  deter¬ 
mines  the  robustness  of  the  FSRO  control  system.  However,  N(sl 
-  A  +  BKC)~'M  is  indeed  the  robustness  function  of  a  state- 
feedback  regulator  subject  to  AA.  (It  can  be  shown  easily  by 
constructing  a  TITO  system  for  the  state-feedback  regulator.) 
Therefore,  Theorem  3  implies  that  a  regulator  with  an  asymptotic 
full-order  estimator  recovers  the  robustness  of  a  regulator  with 
full-state  feedback.  Furthermore,  we  see  that  the  sensitivities  of 
asymptotic  infinite  poles  of  the  observer  part  do  not  contribute  to 
the  robustness.  Although  they  may  be  sensitive  to  A  A.  these 
infinite  poles  cannot  be  perturbed  to  the  CRHP  by  AA.  An  exact 
dual  of  the  above  asymptotic  property  exists  as  the  regulator  gain 
Kc  instead  of  A7 becomes  infinitely  large  (i.e. ,  Kc  =  $KC  where  (3 
-*  00,  and  Rc  is  chosen  as  N  =  QKC  where  Q  is_a  finite  matrix). 

If  p  >  /  and  M  is  full  rank,  we  cannot  find  a  Kf  such  that  M  = 
KjP.  Similarly,  if  q  >  m  and  N  is  full  rank,  there  does  not  exist 
Kc  such  that  N  =  QKC.  Therefore,  the  above  asymptotic  property 
cannot  be  obtained  if  p  >  I  and  q  >  m.  i.e.,  the  rank  of  the  input 
matrix  B  and  the  output  matrix  C,  which  are  assumed  to  be  full 
rank,  limit  the  rank  of  M  and  N  in  applying  the  asymptotic,  partial 
desensitization  procedure  described  above. 

In  Theorem  3,  the  conditions  seem  to  be  redundant  since  we 
only  use  the  fact  that  A  -  BKC  and  A  -  KfC  do  not  have  pure 
imaginary  eigenvalues  and  C(sl  -  A )~'Kf  is  not  singular  along 
the  imaginary  axis.  However,  the  closed-loop  system  should  be 
stable  for  any  robustness  issue  to  be  meaningful.  Also,  C(sl  - 
A)'  'A/ should  not  have  CRHP  zeros  since  those  zeros  determine 
the  asymptotic  closed-loop  poles.  In  the  next  section,  the 
asymptotic  observer  with  closed-loop  stability  can  be  obtained  via 
a  Kalman-Bucy  filter  (KBF)  design. 

D.  Asymptotic  LQG  Design  Synthesis 
Consider  a  KBF  problem  stated  as 
x  =  Ax+  Bu  + 
y  =  Cx  +  vt) 

x  =  Ax+  Bu  +  Kj(y  -  Cx) 

with  a  cost 

7=  hm  if  (x-x)T(x-x)  tfrj 

where  £  and  q  arc  unit-strength  Gaussian  noises  of  appropriate 
dimensions.  Under  the  assumption  that  (.9,  R)  is  stabilizable  and 
(A,  C)  is  detectable,  the  optimal  filter  gain  K,  is  obtained  as 
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whore  A,  in  the  positive  definite  solution  to  a  Riccati  equation 

i\d  rx ,-i i' r r  rr  r  - ~  ycro:  =  o. 

L  ‘ 

Theorem  4:  If  there  exists  a  full  rank  R  G  U”  ' 1  such  that  M  is 
column-similar  to  R,  (--1,  R)  is  siabilizable.  and  C{si  -  .-I)  'R 
has  no  CRHP  zero.  then,  as  v  —  0,  the  asymptotic  properties  of 
Theorem  3  are  obtained. 

Proof:  As  e  ->  0,  K,  —  (l/v)R  If  where  IF  is  a  nonsingular 
matrix,  if  (.4,  R)  is  stabilizable.  (.4,  C)  is  detectable,  and  C(s/  - 
<4)'‘A  is  minimum  phase.  The  proofs  of  these  properties  are 
given  in  |3|.  (9).  [15]  and  not  repeated  here.  Let  y  =  (l/e).  Then 
we  see  that  the  asymptotic  gain  assumes  the  form  K,  as  A/  = 
yAH'  +  Kf0  where  ( 1  /-y) A'/„  -»  0  as  y  -*  oo.  Let  Kf  =  AH’. 
Then,  if  A/  is  column-similar  to  R.  there  exists  a  matrix  P  such 
that  M  =  RP  and  M  =  RP  =  Kf\V~'P.  Therefore.  A/  is  also 
column-similar  to  AV.  Finally,  the  closed-loop  stability  is  automat¬ 
ically  satisfied  by  a  LQG  design.  □ 

Suppose  that  s  =  c0  is  a  zero  of  (A,  M,  C).  Then  the  system 
matrix 


is  rank  deficient,  and  there  exist  vectors  p,  G  Si" * 1  and  p2  G 
Jlpx  i  such  that 

(z0I-A)px  +Mpi  =  0,  Cpi  =0. 

Since  span  {A/}  C  span  {A},  we  can  find  a  vector  p,  G  Jj'*1 
such  that 

(z0I~A)pl  +  Rp,  =  0,  Cp{  =0. 

This  implies  that  the  zeros  of  (A,  M,  C)  are  also  the  zeros  of  (A, 
R,  C).  In  other  words,  C {si  -  A)~ '  R  is  minimum-phase  only  if 
C(sl  -  A)~'M  is  minimum-phase.  Therefore,  we  can  apply  the 
asymptotic  LQG  synthesis  only  for  minimum-phase  C(sl  - 
A  )~  'M. 

The  dual  of  Theorem  4  is  obtained  by  considering  a  LQR 
problem  but  is  not  treated  here.  The  importance  of  Theorem  4  is 
that  there  exists  a  direct  structural  relationship  between  the 
parameter  variation  and  the  optimal  LQG  weighting  matrices  for 
robustness  (A/  and  R  for  the  above  asymptotic  procedure).  As  far 
as  an  observer  design  is  concerned,  these  results  can  be 
interpreted  as  follows:  1)  given  M,  which  gives  partial  informa¬ 
tion  on  the  structure  of  AA,  we  can  choose  y2RRr  as  the 
covariance  of  the  process  noise  (a  natural  choice  is  A  =  M  if  p  = 
1);  and  2)  as  y  -*  oo,  the  stability  robustness  is  determined 
through  the  regulator  gain  Kc  (Theorem  3)  while  the  nominal 
observer  poles  become  insensitive  to  A/4.  Although  the  robust¬ 
ness  of  the  LQ  pan  depends  on  the  regulator  gain  Kc  and  AA,  a 
general  theory  is  not  available  for  finite  regulator  gains.  However, 
it  is  possible  to  apply  the  asymptotic  procedures  to  the  regulator 
and  observer  pan  at  the  same  time.  Using  a  similar  method  to  the 
one  used  in  Theorem  3,  we  can  prove  that  absolute  robustness 
(i.e.,  G22  =  0)  is  obtained  asymptotically  if  M  and  A/ are  used  for 
the  observer  and  regulator  design,  respectively.  Finally,  we  can 
show  that  the  nonuniqueness  of  the  I/O  decomposition  is 
irrelevant  to  the  asymptotic  LQG  design.  For  example,  if  ML. \  is 
an  I/O  decomposition  of  A/l .  so  is  A/7’1  7",  ' LT ,  1  73 /V.  Then,  we 
see  that  an  R  satisfying  the  conditions  of  Theorem  4  for  A/  also 
satisfies  them  for  MT,. 

E.  Remarks  on  the  LQG/LTR  Method 

This  section  shows  that  LQR/LTR  is  a  special  case  of  the 
asymptotic  weighting  strategy  discussed  in  the  previous  section. 
Two  types  of  LQG/LTR  procedures,  which  are  dual  to  each  other, 
have  been  studied  in  various  works  [l]-|4|.  One  is  sensitivity 


recovery  where  the  loop  transfer  function  ot  the  KBF  is  recovered 
at  the  output  by  an  asymptotic  regulation  of  the  output  v  =  Cx. 
The  other  is  robustness  recovery  where  the  loop  transter  function 
of  LQR  is  recovered  at  input  by  an  observer  design  based  on  an 
asymptotic  KBF  with  a  white  noise  injected  at  the  input  (i.e..  R  = 
B).  Note  that  observer  insensitivity  to  parameter  variations 
requires  that  M  be  column-similar  to  R.  Therefore,  in  the  LQG/ 
LTR  method,  the  asymptotic  finite  poles  are  guaranteed  to  be 
insensitive  only  to  either  input-similar  or  output-similar  parameter 
variations.  For  example,  the  robustness  recovery  procedure 
makes  the  observer  pan  insensitive  to  input-similar  variations, 
and  the  stability  robustness  to  such  variations  is  solely  determined 
by  the  regulator  pan.  Thus,  the  LQG'LTR  procedure  works  well 
if  the  structure  of  parameter  variation  is  related  to  the  structure  of 
the  input  matrix  B  or  the  output  matrix  C.  i.e..  AA  is  similar  for 
the  (.4,  B,  C).  However,  if  A.4  =  —MLN  is  neither  input- 
similar  nor  output-similar,  all  the  closed-loop  poles  are  penurbed 
when  the  LQG/LTR  procedure  is  used,  and  there  is  no  guaranteed 
stability  robustness  to  the  parameter  variation  in  spite  of  the 
guaranteed  stability  margins.  It  is  noted  that  the  matching 
condition  ot  the  Lyapunov-function  methods  [23|,  [25]  requires 
that  parameter  variations  be  similar.  This  observation  may  imply 
that  robust  stabilization  with  similar  parameter  variations  is  less 
complicated  than  with  other  types  of  parameter  variations. 

The  relationship  between  the  error  models  of  Section  II-A  and 
similar  parameter  variations  is  now  discussed.  For  an  input- 
similar  variation  (A/  =  BP) 

in  =Sn  ~  8\zE(I  +  g22 L) "  'g2| 

=  C6B  -  CoBPL  ( 7  +  N6BPL  V'N<t>B 
=s„(f + PLNAB)  -  L 

The  penurbed  plant  is  expressed  as  a  multiplication  of  the  plant 
and  a  transfer  function  placed  at  the  input,  which  is  similar  to  the 
conventional  uncertainty  model  E,„(s)  discussed  in  Section  II-A. 
Similarly,  an  output-similar  parameter  variation  has  a  structural 
similarity  to  a  conventional  uncertainty  model  placed  at  the 
output.  It  is  noted  that  these  multiplicative  forms  cannot  be 
obtained  if  A/1  is  not  similar.  We  consider  the  robustness  function 
G22  associated  with  the  input-similar  parameter  variations  to 
examine  the  implications  of  this  structural  similarity.  Suppose  that 
A7  is  input-similar.  Then  there  exists  a  matrix  P  such  that  M  = 
BP,  and 

G2;  =  A'0  M-  N0  BK  ( /  -r  Co  BK )  ~  1  C<SM 
=  N<i>BP  -  N$BK{I  +  CoBR)~  'C<i>BP 

=  N4>B{l  +  KG)iP 

where  G  =  Cd>£.  Since  P  and  N<t>B  arc  arbitrary  for  general 
input-similar  variations,  the  robustness  to  input-similar  variations 
is  optimized  by  minimizing  d[(/  +  A'G)'1].  or  equivalently 
maximizing  q\(I  +  A'G)[.  Also,  the  robustness  to  output-similar 
parameter  variations  can  be  improved  by  increasing  a|(7  +  A'G)). 
These  singular  values  arc,  in  fact,  the  robustness  measures  used  in 
the  conventional  singular-value  methods  [10],  [11].  From  this 
observation,  we  can  conclude  that  the  robustness  to  the  uncer¬ 
tainty  modeled  at  the  input  (at  the  output,  respectively)  is 
equivalent  to  the  robustness  of  the  class  of  input-similar  (output- 
similar.  respectively)  variations.  The  close  relationship  between 
similar  parameter  variations  and  conventional  multiplicative 
uncertainties  also  shows  the  limitation  of  the  existing  multiplica¬ 
tive  uncertainty  models  in  modeling  parameter  variations  that  only 
similar  variations  are  properly  represented  by  these  models. 
Therefore,  the  LQG/LTR  methods  based  on  these  uncertainty 
models  may  fail  when  applied  to  a  general  parameter  variation 
problem. 
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IV.  Numerical  Examples 

We  consider  the  example  presented  in  |IS|.  The  plant  is  given 


A  = 

"  -  1 

0 

0 

-2 

1 

O  O 

B  = 

I 

_  s 

0 

0 

1 

C—  (3  3  31  AC=[f  0  0], 


The  transfer  functions  of  the  nominal  system  and  perturbed 
system  are  given  as 


G(s)  = 


6 _ 

(s+  l)(s  +  2)(s+3)  ’ 


G(s)  = 


6  +  <(s  +  2)(s+3) 
(5+  I  )(5 +2)(5  +  3)  ' 


respectively.  It  is  noted  that  an  arbitrarily  small  parameter 
variation  creates  a  pair  of  zeros  at  infinity.  For  e  >  0.  these  zeros 
are  located  at  5  =  ±00  along  the  real  axis.  If  e  <  0.  then  we  have 
two  complex  zeros  at  s  =  —2.5  ±  yoo.  Although  these  zeros 
affect  very  little  the  low-frequency  characteristics  of  the  plant,  the 
plant  becomes  very  uncertain  in  the  high  frequency  range  since 
they  introduce  large  gain  and  phase  variations. 

By  using  the  state  augmentation  procedure  described  in  Section 
II-C  we  obtain  the  augmented  system  such  that 


’  -I 

0 

0 

0  ' 

1 

r—  •< 

0 

Aa  = 

0 

0 

-2 

0 

0 

-3 

0 

0 

-2 

1 

c:= 

0 

0 

3  +  e 

3 

3 

—  r 

0 

T 

’0" 

‘  -1  ' 

0 

NT= 

0 

0 

0 

1 

0 

where  r  =  1000  for  computation.  For  the  LQ  pan,  the  cost  is 
given  by 

1  fr 

J  =  hm  —  1  (zTz  +  p2uTu)  dt 

r-oc  /  j  0 

where  the  weightings  are  chosen  as  z  =  y  =  Cx  and  p  =  0.1. 
This  produces  three  nominal  regulator  poles  5  =  -  4.52  and  s  = 
—  2.26  ±  y  2.87.  For  the  filter  pan,  the  weighting  matrix  is  given 
as 

Q/=W/„MMT+  w1^ Ba B  T 

If  w/m  =  0  and  tvyj  -*  00 ,  then  we  have  a  robustness  recovery 
procedure,  which  is  one  of  the  LQG/LTR  procedures.  The 
weighting  iv^,  =  0  and  w/m  -*  00  corresponds  to  the  asymptotic 
LQG  based  on  the  structure  of  the  parameter  variation. 

Figs.  2-4  show  how  the  sensitivity  of  the  LQG  system  is 
reduced  by  using  the  structure  of  the  parameter  variation  in 
selecting  the  weighting  matrix  of  the  filter  Riccati  equation.  The 
far-left  poles  induced  by  the  state  augmentation  arc  not  shown  in 
these  iigures.  In  Fig.  2  the  root  loci  are  plotted  for  the  robustness 
recovery  procedure  when  (f  |  <  0.5.  The  filter  poles  are  shown  to 
be  very  sensitive  as  pointed  out  in  (I8|.  The  sensitivity  of  the 
LQG/LTR  procedure  is  due  to  the  sensitivity  of  the  filter  because 
M  is  not  similar  to  Ba.  Fig.  3  shows  that  these  poles  become  less 
sensitive  when  ny,„  increases  from  0.0  to  10.0.  The  sensitivity  is 
considerably  reduced  by  the  addition  of  the  weighting  i v-  SIX!  T 
associated  with  the  parameter  variation  440.  Fig.  4  also  shows 
that  the  filter  poles  are  completely  insensitive  when  M  is  used 
instead  of  Ba  for  the  weighting  matrix.  In  this  example,  the  three 
filter  poles  s  =  -  I,  -2,  -3  are  the  open-loop  poles.  These 
poles  correspond  to  stabilizable  but  uncontrollable  modes  of  the 
PJir  (4Jt  A /  )  ot  the  augmented  system.  They  are  the  decoupling 


zeros  of  C„(sl  Aa)  'A/,  and,  by  Theorem  2,  are  insensitive  to 
A4„  as  wfm  -*  00. 

Note  that  the  regulator  poles  are  also  insensitive  in  Fig.  4.  By  a 
direct  evaluation  of  K(s)  we  can  show  that  K{s)  -*  0  as  w/m  -*  0. 
This  implies  that  the  optimal  robustness  for  this  example  is 
obtained  without  feedback  rather  than  using  high  feedback  gain  It 
is  because  the  output  matrix  Cof  the  original  problem  is  assumed 
to  be  totally  uncertain  (i.e.,  only  M  is  used  for  the  filter-part 
weighting)  while  any  variation  of  C  does  not  perturb  the  poles  of 
the  plant.  This  result  is  valid,  if  trivial,  as  far  as  robustness  is 
concerned.  The  matrix  N can  be  used  for  the  asymptotic  reeulator 
without  resulting  in  a  trivial  A’(s)  as  shown  in  Fig.  5.  In  this  case 
the  use  of  N corresponds  to  the  assumption  that  the  first  column  of 
the  augmented  A  matrix  is  uncertain.  The  regulator  slate 
weighting  matrix  now  used  is 

Q,  =  »*■■„ N  TN  +  C TaC„ . 

If  iv, r  — >  00  and  iv, „  =  0.  then  the  LQG  LTR  robustness  property 
is  obtained  at  the  system  output.  If  »v,v  =  Oand  iv,„  —  00,  then  the 
regulator  is  made  insensitive  to  the  parameter  error  in  C. 

V.  Conclusions 

In  this-  paper,  the  plant  uncertainties  are  divided  into  two 
groups:  unstructured  uncertainties  such  as  unmodcled  dvnamies 
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Fig.  5.  Root  loci  of  the  asymptotic  LQG  design  using  N. 


and  structured  uncertainties  such  as  parameter  variations.  These 
two  groups  are  compared  to  each  other,  and  the  inadequacy  of  the 
current  modeling  methods  for  parameter  variations  is  discussed. 
Based  on  the  I  O  decomposition  of  parameter  variations,  a  new- 
modeling  method  :s  then  introduced  to  solve  this  problem.  An 
asymptotic  LQG  procedure  is  proposed  for  desensitizing  the 
closed-loop  poles  when  parameter  variations  are  present.  The 
class  of  parameter  variations  to  which  an  asymptotic  LQG  is 
robust  (or  insensitive)  is  determined  by  the  structure  of  the 
weighting  matrices.  This  implies  that,  if  the  structure  of  a 
parameter  variation  is  known,  then  the  robustness  can  be 
improved  by  this  asymptotic  LQG  procedure.  As  shown  in 
Section  IV  the  robustness  of  the  overall  system  is  not  affected  by 
the  filter  part  but  determined  by  the  controller  part,  which  is 
independent  of  the  asymptotic  filter  gain  Kf.  Therefore,  for 
parameter  variations,  the  LQG  synthesis  for  robustness  reduces  to 
a  synthesis  problem  of  the  nonasy mptotic  LQ  controller  (or  the 
Kalman-Bucy  filter  in  the  dual  problem).  It  is  then  shown  that  the 
LQG/LTR  procedure  is  a  special  case  of  this  asymptotic  ap¬ 
proach.  In  fact,  the  guaranteed  stability  margins  of  LQG/LTR 
controllers  are  not  meaningful  for  the  general  class  of  parameter 
variations  considered  here  but  for  the  even  smaller  class  of 
variations  such  that  the  parameter  structures  are  similar  to  the 
input  matrix  or  output  matrix  of  the  plant. 
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